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Preface 


The technological value of computational fluid dynamics has become undis- 
puted. A capability has been established to compute flows that can be in- 
vestigated experimentally only at reduced Reynolds numbers, or at greater 
cost, or not at all, such as the flow around a space vehicle at re-entry, or 
a loss-of-coolant accident in a nuclear reactor. Furthermore, modern com- 
putational fluid dynamics has become indispensable for design optimization, 
because many different configurations can be investigated at acceptable cost 
and in short time. A distinguishing feature of the present state of computa- 
tional fluid dynamics is, that large commercial computational fluid dynamics 
computer codes have arisen, and found widespread use in industry. The days 
that a great majority of code users were also code developers are gone. This 
attests to the importance and a certain degree of maturity of computational 
fluid dynamics as an engineering tool. At the same time, this creates a need 
to go back to basics, and to disseminate the basic principles to a wider audi- 
ence. It has been observed on numerous occasions, that even simple flows are 
not correctly predicted by advanced computational fluid dynamics codes, if 
used without sufficient insight in both the numerics and the physics involved. 
The present book aims to elucidate the principles of computational fluid dy- 
namics. With a variation on Lamb’s preface to his classic Hydrodynamics, 
owing to the elaborate nature of some of the methods of computational fluid 
dynamics, it has not always been possible to fit an adequate account of them 
into the frame of this book. 


When technology progresses from the pre-competitive to the competitive 
stage, unavoidably, something like an information stop sets in. To protect 
investments, and because of the relatively long learning curve to be traversed 
in order to become familiar with a large computer code, a certain sluggishness 
of change makes itself felt. These consequences of the widespread distribu- 
tion of large computational fluid dynamics codes needs to be counteracted 
by the dynamics of unencumbered scientific enquiry, not to pursue change 
for change’s sake, but because much improvement seems feasible. Therefore 
I hope the book will be helpful not only to users of computational dynamics 
codes, but also to researchers in the field. 


VI 


The book has grown out of graduate courses for doctoral students and prac- 
ticing engineers, held under the auspices of the J.M. Burgers Center, the 
national inter-university graduate school for fluid dynamics in The Nether- 
lands. I expect teachers of advanced courses of computational fluid dynamics 
courses will find this a useful book. For an introductory course the book 
seems too advanced, but I have found selected material from the manuscript 
useful in teaching an introductory undergraduate course. 


Other relatively recent introductions to the subject of computational fiuid 
dynamics that the reader will find useful are Ferziger and Perić (1996), 
Fletcher (1988), Hirsch (1988), Hirsch (1990), Peyret and Taylor (1985), 
Roache (1998a), Shyy (1994), Sod (1985), Tannehill, Anderson, and Pletcher 
(1997), Versteeg and Malalasekera (1995), Wendt (1996). The two volumes by 
Hirsch give an especially wide coverage. The present book differs from these 
works in the following respects. More mathematical and numerical analy- 
sis is given, but the mathematical background of the reader is assumed not 
to go beyond what physicists and engineers are generally familiar with. The 
maximum principle for differential equations and numerical schemes gets gen- 
erous attention, in order to put discussions of spurious ‘wiggles’, accuracy of 
schemes on nonuniform grids, and accuracy of numerical boundary condi- 
tions on a firm footing. Singular perturbation theory is introduced to predict 
qualitative features of the flow, to which numerical methods can be adapted 
for better accuracy and efficiency. In particular, singular perturbation the- 
ory is used with a fair amount of rigor to demonstrate convincingly how it 
is possible to achieve accuracy and computing cost uniform in the Reynolds 
number, showing that a ‘numerical windtunnel’ that operates at arbitrarily 
high Reynolds number is feasible, notwithstanding the effect of ‘numerical 
viscosity’. Much attention is given to the principles and the application of 
von Neumann stability analysis, giving useful stability conditions, some of 
them new, for many schemes used in practice. Godunov’s order barrier and 
how to overcome it by slope-limited schemes is discussed extensively. The 
theory of scalar conservation laws including the nonconvex case is treated. 
Distributive iteration is used as a unifying framework for describing itera- 
tive methods for the incompressible Navier-Stokes equations. The principles 
of Krylov subspace and multigrid methods for efficient solution of the large 
sparse algebraic systems that arise are introduced. Much attention is given to 
the complications brought about by geometric complexity of the flow domain, 
including an introduction to tensor analysis. A chapter on unified methods to 
compute incompressible and compressible flows is included. In order to help 
the reader along who wants to delve deeper and to quickly reach the current 
research frontier, references to more advanced literature are provided. 


Errata and MATLAB software related to a number of examples discussed in 
the book my be obtained via the author’s website, to be found at 
ta.twi.tudelft.nl/nw/users/wesseling 


VII 


Combining the writing of a textbook of this size with the daily tasks of a 
university professor was not always easy, and would have been impossible 
without the support of the numerical team, and in particular our secretary 
Tatiana Tijanova. Her dedication, love of perfection and capability to cope 
with repeated stress were of vital importance for keeping the manuscript or- 
ganized, and finally bringing it into publishable form. I am indebted to dr. 
C. Vuik for advice on Chap. 7, to professor G.S. Stelling for checking up on 
Chap. 8, and to professor F.T.M. Nieuwstadt for casting a critical eye on 
what I wrote about turbulence. The enthusiasm of the students in the grad- 
uate courses on computational fluid dynamics of the J.M. Burgers Center, 
and the cooperation with my fellow teacher professor A.E.P. Veldman, were 
inspiring and stimulating. The moral support of my wife Tineke was and re- 
mains invaluable. 


Delft, June 2000 P. Wesseling 
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1. The basic equations of fluid dynamics 


1.1 Introduction 


Fluid dynamics is a classic discipline. The physical principles governing the 
flow of simple fluids and gases, such as water and air, have been understood 
since the times of Newton. Sect. IX of the second book of Newton’s Principia 
starts with what came to be known as the Newtonian stress hypothesis : 
“The resistance arising from the want of lubricity in the parts of a fluid, is, 
other things being equal, proportional to the velocity with which the parts of 
the fluid are separated from another”. This hypothesis is followed by Propo- 
sition LI, in which the flow generated by a rotating cylinder in an unbounded 
medium is considered. The period of the orbit of a fluid particle is found to 
be proportional to the distance r from the cylinder axis. This is not correct. 
The source of the error is that the master balances force instead of torque; 
this may be of some consolation to beginning students who find mechan- 
ics difficult. The closing remark “All of this can be tested in deep stagnant 
water” must be taken with a grain of salt. Newton was more interested in 
celestial mechanics than in fluid dynamics. His aim was to test Descartes’s 
vortex theory of planetary motion, which would gain credibility if the pe- 
riod of the orbit of a particle in this flow would be proportional to 73/2. in 
fact, it is proportional to r*. The mathematical formulation of the laws that 
govern the dynamics of fluids has been complete for a century and a half. 
In the nineteenth century and the beginning of the twentieth, eminent scien- 
tists and engineers were drawn to the subject, and gave it clarity, unification 
and elegance, as exemplified in the classic work of Lamb (1945), that first 
appeared in 1879. In the preface to the 1932 edition Lamb writes, that the 
subject has in recent years received considerable developments, classic fluid 
dynamics having a widening field of practical applications. This has remained 
true ever since, especially because in the last forty years or so classic fluid 
dynamics finds itself in the company of computational fluid dynamics. This 
new discipline still lacks the elegance and unification of its classic counter- 
part, and is in a state of rapid development, so that we can do no more than 
give a glimpse of its current status. But first, we take a look at classic fluid 
dynamics. 
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Continuum hypothesis 


The dynamics of fluids is governed by the conservation laws of classical 
physics, namely conservation of mass, momentum and energy. From these 
laws partial differential equations are derived and, under appropriate cir- 
cumstances, simplified. It is customary to formulate the conservation laws 
under the assumption that the fluid is a continuous medium (continuum hy- 
pothesis). Physical properties of the flow, such as density and velocity can 
then be described as time-dependent scalar or vector fields on RË, for exam- 
ple p(t, æ) and u(t, æ). 


For the flowing medium we restrict ourselves here to gases and liquids. More 
general media, such as mixtures of gases and liquids (multiphase flows), will 
not be considered. For a liquid the continuum hypothesis is always satis- 
fied in practice. A gas satisfies the continuum hypothesis (to a sufficient 
degree) if K « 1, with K the Knudsen number, defined as K = A/L, with 
A the mean free path and L the length scale of the flow phenomenon under 
study. Consider for example flow over a flat plate with free-stream velocity 
V (Fig. 1.1). It is known that at the plate a boundary layer is generated, in 


= — - 
= = 
— = 


— 
— 
— ae am m 


Fig. 1.1. Flow over a flat plate 


which the velocity changes from zero to V. The thickness ô of this bound- 
ary layer is the relevant length scale. If the fluid is air at room temperature 
then \ = 0.4 um. With V = 1 m/s, experiment and boundary layer theory 
tell us that ô = 2.5 cm at 0.5 m downstream of the leading edge, so that 
here K = 0.16 x 1074. Hence, at this location momentum exchange due to 
friction takes place over a length scale of about 60,000 mean free paths, and 
the continuum hypothesis is very well satisfied. Perhaps this would not be so 
quite near the leading edge of an extremely sharp flat plate, but that need 
not concern us here. We will throughout assume the continuum hypothesis, 
and the fiowing medium, be it gas or liquid, will often be called the fluid. The 
most common situation in technology where the continuum hypothesis has 
to be abandoned is the flow of very rarefied gases. Such a flow regime occurs 
at a certain stage of atmospheric re-entry of space vehicles. Unexpectedly 
perhaps, for flows of the interstellar medium often K < 1, because of the size 
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of the galactic length scale, so that the continuum hypothesis can be safely 
applied there. 


Lagrangean and Eulerian formulation 


The continuum hypothesis enables us to speak of the properties of a flow at 
a point in space, and of the physical properties of an infinitesimally small 
volume element of the fluid, to which we shall refer for brevity as a material 
particle . A flow can be described exhaustively by specification of the physical 
properties of each material particle as a function of time. This kind of speci- 
fication of a flow is called the Lagrangean formulation. Alternatively, a flow 
may be described by specification of the time history of the flow properties at 
every fixed point of the domain. This is called the Eulerian formulation. The 
second formulation is usually more accessible for analysis and computation 
than the first, but sometimes the Lagrangean formulation may be preferable, 
for example when fluid interfaces have to be tracked. In most cases we ask 
for flow properties at fixed locations, such as the pressure at a wall, and this 
information is provided directly by the Eulerian formulation, to which we will 
adhere throughout this book. The Eulerian and Lagrangean points of view 
meet in the transport theorem, to be discussed in Sect. 1.3. 


Selection of topics 


The reader is assumed to be familiar with the principles of fluid dynamics, 
of vector analysis, of numerical linear algebra and of the numerical analysis 
of partial differential equations. 


Fluid dynamics is a vast discipline, utilizing many different mathematical 
models. As the Mach number M (to be introduced later) varies, we encounter 
incompressible (M < 1), subsonic (0 < M < 1), transonic (M = 1), super- 
sonic (M > 1) and hypersonic (M >> 1) flow. In hypersonic fiow, chemical 
processes taking place in the fluid have to be accounted for, giving rise to 
the discipline of aerothermochemistry. Multiphase flows play a large role in 
chemical engineering and reservoir engineering. Flows in porous media are 
governed by the Darcy equations. In hydraulic engineering the shallow-water 
equations are predominant. In ship hydrodynamics the free surface (water-air 
interface) often has to be accounted for. Capillary forces may be important. 
As the Reynolds number (to be introduced shortly) increases, transition from 
laminar to turbulent flow occurs, giving rise to a plethora of more or less 
semi-empirical turbulence models. Rotation causes special effects, important 
in oceanography, and in atmospheric and planetary fluid dynamics. We have 
made a selection of topics. The book is focussed on the incompressible and 


compressible Navier-Stokes equations, restricting ourselves to M A 2, thus 
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catering mainly to the needs of industrial and environmental fluid dynamics 
and aeronautics. We will also pay attention to the shallow-water equations. 
Fortunately, many of the underlying principles carry over to cases not treated. 
In particular, a thorough understanding of the analytical and computational 
aspects of the comparatively simple convection-diffusion equation gives valu- 
able insight in more complex models. Therefore this equation will receive 
much attention. 


Although most practical flows are turbulent, we restrict ourselves here to 
laminar flow, because this book is on numerics only. Turbulence modeling is 
a vast subject in itself, that is briefly discussed in Sect. 1.13, where pointers 
to the literature are given for further study. The numerical principles uncov- 
ered for the laminar case carry over to the turbulent case. To facilitate this, 
viscosity is usually assumed variable. 


Fluid dynamics is governed by partial differential equations. These may be 
solved numerically by finite difference, finite volume, finite element and spec- 
tral methods. In engineering applications, finite difference and finite volume 
methods are predominant. In order to limit the scope of this work, we will 
confine ourselves to finite difference and finite volume methods. 


Since in computational fluid dynamics mathematical modeling aspects in- 
variably play an important role, we devote the remainder of this chapter to a 
thorough derivation of the basic equations of fluid dynamics and their main 
simplifications. Of course, the subject cannot be adequately reviewed in a 
single chapter. For a more extensive treatment, see Batchelor (1967), Chorin 
and Marsden (1979), Kreiss and Lorenz (1989), Lamb (1945), Landau and 
Lifshitz (1959), Sedov (1971), Zucrow and Hoffman (1976), Zucrow and Hoff- 
man (1977). A brief introduction to the history of the subject, with references 
to further literature, is given by Eberle, Rizzi, and Hirschel (1992). 


Good starting points for exploration of the Internet for material related to 
computational fluid dynamics are the following websites: 
www.cfd-online.com/ 
www. princeton. edu/” gasdyn/fluids.html 
and the ERCOFTAC (European Research Community on Flow, Turbulence 
and Combustion) site: 
imhefwww.epfl.ch/ERCOFTAC/ 


Readers well-versed in fluid dynamics may skip the remainder of this chapter, 
perhaps after taking note of the notation introduced in the next section. But 
those less familiar with this discipline will find it useful to continue with the 
present chapter. 
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1.2 Vector analysis 


Cartesian tensor notation 


The basic equations will be derived in a right-handed Cartesian coordi- 
nate system (£1, £2,..., £a) with d the number of space dimensions. Bold- 
faced lower case Latin letters denote vectors, for example, # = (£1, 2,..., £a). 
Greek letters denote scalars. In Cartesian tensor notation, which we shall of- 
ten use, differentiation is denoted as follows: 


Px = 06/02 . 


Greek subscripts refer to coordinate directions, and the summation conven- 
tion is used: summation takes place over Greek indices that occur twice in a 
term or product, for example: 


d d 
Uata = 5 Uata, Qaa = X 84r? . 
a=1 


asl 


We will also use vector notation, instead of the subscript notation just ex- 
plained, and may write divu, if this is more elegant or convenient than 
the tensor equivalent ua; and sometimes we write grad ¢ for the vector 


(1, 0,2, $3) 


Divergence theorem 


We need the following fundamental theorem: 


Theorem 1.2.1. For any volume V C R° with piecewise smooth closed sur- 
face S and any differentiable scalar field ġ we have 


J badV = J bngdS , 
VY 5 


where n is the outward unit normal on S. 


For a proof, see for example Aris (1962). 


A direct consequence of this theorem is: 


Theorem 1.2.2. (Divergence theorem). 
For any volume V C R¢ with piecewise smooth closed surface S and any 
differentiable vector field u we have 


J ävuav= f unas, 


V 5 


where n is the outward unit normal on 5S. 
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Proof. Apply Theorem 1.2.1 with ġa = ua, @ = 1,2,...,d successively and 
add. o 


A vector field satisfying divu = 0 is called solenoidal. 


Stokes’s theorem 


The curl of a vector field is defined by 


U3,2 — U2,3 
curlu = u13 — 43,1 
U21 — U1,2 


That is, the x1-component of the vector curlu is uz 2—u2,3, etc. Often, the curl 
is called rotation, and a vector field satisfying curl = 0 is called irrotational. 


Theorem 1.2.3. (Stokes’s theorem) 
Let C be a closed curve and let S be any piecewise smooth surface bounded 
by C. Then for every differentiable vector field u we have 


fude = |f cuiu) -nas 
C 5 


with dæ the curve element along C and n the unit normal to S, right-handed 
with respect to the direction of traversing C. 


For a proof, see for example Aris (1962). 


Potential flow 


Using Stokes’s theorem it can be shown (cf. Aris (1962)) that if a vector field 
u satisfies curl = 0 there exists a scalar field p such that 


u = grady (1.1) 
(Or ta = P,a) The scalar ọ is called the potential, and flows with velocity 


field u satisfying (1.1) are called potential flows or irrotational flows (since 
curl grady = 0). 


Helmholtz and Clebsch representations 


Useful representations of vector fields are given by the following theorems. 
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Theorem 1.2.4. (Helmholtz representation) 
For any bounded continuous vector field u which vanishes at infinity there 
exist a scalar field p and a solenoidal vector field b such that 


u = grady + curlb. 


Theorem 1.2.5. (Clebsch representation) 
For any bounded continuous vector field u which vanishes at infinity there 
erist scalar fields 6, and 0 such that 


u = grady + Ogrady . (1.2) 


For proofs, see Aris (1962). The scalars in (1.2) are often called Clebsch or 
Monge potentials. 


The volume element 


The outer product of two vectors u and v is defined by 

u x v = [uzv — u3v2)e(1) + (uav — U103)€(2) + (u1v2 — U2v1)e(3) ; 
with e(a) the unit vector in the £q-direction. Note that u x v = —v x u. 
The permutation symbol €apy is defined as follows: 


0, if any two of a, 8, y are the same , 
Eapy = 1, if afy is an even permutation of 123 , 
—1, if a@y is an odd permutation of 123 . 


An even permutation is in order of increasing magnitude, with 1 coming 
after 3, for example 231; the odd case is the reverse, such as 213. Using the 
permutation symbol, the outer product can be written in subscript notation 
as 


UX V = EgBylatpe(y) - 


The volume V of the parallelepiped spanned by the vectors u, v and w (taken 
in right-handed order) is given by (the proof is left as Exercise 1.2.3) 


V=u-(v Xx w) = Enpytatpvy . (1.3) 


Let (y1, y2, y3) be an arbitrary right-handed curvilinear coordinate system, 
related to the Cartesian coordinates (z1, £2, #3) by 


Lea = Lali; Y2, 93) - 


The incremental change in position vector associated with an incremental 
change dya iN Ya 18 
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da'*) = Fn (no summation) . (1.4) 


From (1.3) and (1.4) it follows that the volume element is given by 
dV = dæ . (dæ x dæ®)) = 


ra GET: Ox 
= Eapy On By Jy = Jdydyzdyz , (1.5) 
where 
J Ta Izp Oxy (1.6) 


=e —— . 
apy Oy Oye ðy3 


By inspection it is easily verified that the determinant of a 3 x 3 matrix A 
with elements agg is given by 


det (A) = Fapylq14p2443 « 
Hence, J in (1.6) is the determinant of a matrix A with elements 


OF a 


laf = By,’ 


so that A is recognized as the Jacobian matrix of the mapping (1.4); the 
quantity J is called the Jacobian. 


Two dimensions 


Two-dimensional versions of the above results are easily obtained by putting 
the third component and/or 0/0x3 equal to zero. For example, in two dimen- 
sions, 


curlu = U2,1 — U1,2- 


Often, w = u21— u1,2 can be safely handled as a scalar in a two-dimensional 
context, but in fact curls is a vector with only the x3-component not zero. 


Exercise 1.2.1. Prove Theorem 1.2.1 for the special case that V is the unit 
cube. 


Exercise 1.2.2. Show that curlz is solenoidal. 


Exercise 1.2.3. Show that c = a x b is perpendicular to a and b in right- 
handed direction, and that the magnitude |c| = |a|- |b| cos w, with w the 
angle between a and b. Prove (1.3). 
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1.3 The total derivative and the transport theorem 


Streamlines, particle paths and streaklines 


A streamline is a curve that is everywhere tangent to the velocity vector 
u(t,@) at a given time t. Hence, a streamline may be parametrized with a 
parameter s such that a streamline is a curve defined by 


dz/ds = u(t, æ). 


Let a(t, y) be the position of a material particle at time t > 0, that at time 
t = 0 had initial position y. The particle path that has been traced out by 
the particle is the curve æ(s, y), 0 < s < t. The particle path is related to 
the velocity field by 


Ox(s,y)/Os = u(s,æ), O<s<t. 


A streakline is defined as the curve on which lie all those particles that at 
some earlier point of time passed through a certain point of space, say y. 
This is the curve traced out by releasing a continuous stream of dye in the 
flow at location y. 


In general, streamlines, particle paths and streaklines are all different. But 
when the flow is stationary, i.e. the velocity field is time-independent, they 
coincide. 


The total derivative 


Obviously, the velocity field w(t, æ) of the flow satisfies 


u(t, æ) = ĉzt y) . (1.7) 


The time-derivative of a property ¢ of a material particle, called a mate- 
rial property (for example its temperature), is denoted by D¢/ Dt. This is 
called the total derivative. All material particles have some ¢, so ġ is defined 
everywhere in the flow, and is a scalar field g(t, æ). We have 

be _ ð 


Dt inaa glt elt y)) ’ (1.8) 


where the partial derivative has to be taken with y constant, since the total 
derivative tracks variation for a particular material particle. We obtain 


Dé _ a6 , ðralt,y) 3$ 


Di at ns 
By using (1.7) we get 
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The transport theorem 
A material volume V(t) is a volume of fluid that moves with the flow and 
consists permanently of the same material particles. 


Theorem 1.3.1. (Reynolds’s transport theorem) 
For any material volume V(t) and differentiable scalar field ¢ we have 


d ag . 

T J ọdV = J C +div ọu)dV . (1.9) 
V(t) V (t) 

For the proof we need the following lemma. 


Lemma 1.3.1. Let æ(t, y) describe the trajectory of a material particle, and 


ðJ , 
aa Jdiv u. (1.10) 


Proof. By expanding the determinant it is easy to see that 


ðJ 

ðt 
with Ja obtained from J by replacing the elements in row number a by their 
time derivatives. We have 


= Jit J+ J3, 














aag O ta Öua Oua Ty -u a 
ðt At Oye dys Or, dyg VV V 
Hence 
Ulay Uliyay? U1,yAy3 
Jı = a21 a22 az |= 
a31 a32 a33 
U1,1 U1,2 %1,3 G11 @12 G13 
0) 1 0 a21 G@a2 Q23 = ud. 
0 0 1 a31 @32 033 
Similarly, J2 = uz2,2J, J3 = u3,3J, which completes the proof. D 


Proof of Theorem 1.3.1: 

The initial position y and its position æ at time t of a material particle are 
related by æ(t, y). This provides a mapping between V(t) and V(0), that 
we can use to transform the integral over V(t) to an integral over V(0). In 
Yo-coordinates, the volume element is given by (1.5), so that, using (1.8) and 
(1.10), 
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d d . 
Ti J ġdV = Pr J ġdJdyidyzdy3 = J CET + ġJdivu)dyidyzdy3 . 
V(t) V (0) V (0) 


Transforming back to r,-coordinates, we obtain 


d _ Do . ð$ 
a J ġdV = f (Get bdivu)av = [ G+ divýu)av 
V(t) V(t) V(t) 
m 


A somewhat similar theorem applies to the rate of change of circulation 
around a material contour. The circulation around a closed curve C is defined 
as 


r(C)= $u-de, 


C 


with dæ a curve element. A material contour C(t) is a simple (2.e. non- 
intersecting) closed curve that consists of material particles. We will prove 


Theorem 1.3.2. (Rate of change of circulation). 
For any material contour C(t) and differentiable vector field u(t, æ) we have 


d Du 
— -dæ = — - dæ. . 
u - dæ f D2 (1.11) 
c(t) 


dt 
c(t) 


Proof. As in the proof of the preceding theorem, we transform the integral 
to an integral over the initial curve C’'(0): 


d d Oxu(t, y) 
_ . dæ = — OT dy, . 
$ u £ di f u(t, z(t, y)) Pya dya 


dt 
C(t) C(0) 


Since Ou(t, x(t, y))/Ot = Du/ Dt and 0*a(t, y)/dtdy, = ðu/Iya, this can 
be written as 
d 
— $ u-de- TE, duet f u Sidu = f FE de, 
ðt PYa 


OYa 
C(t) C{0) C(0) C(t) 





because $ u- ôu dy, = E $ ĉu dya = 0. 
c(0) co) 7 


We are now ready to formulate the conservation laws for mass, momentum 
and energy. 
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1.4 Conservation of mass 
The continuity equation 


The mass conservation law says that the rate of change of mass in an arbitrary 
material volume V(t) equals the rate of mass production in V(t). This can 
be expressed as 


© | nav = J odV , (1.12) 
V(t) V(t) 


where p(t, #) is the density of the material particle at time t and position æ, 
and o(t, æ) is the rate of mass production per volume. In practice, o Æ 0 only 
in multiphase flows, in which case (1.12) holds for each phase separately. We 
take o = 0, and use the transport theorem to obtain 


Op. _ 
J (a + divpu)dV =0. 


V(t) 


Since this holds for every V(t) the integrand must be zero: 


Op . _ 
g + tiveu=0. (1.13) 


This is the mass conservation law, also called the continuity equation. 


Incompressible flow 


An incompressible flow is a flow in which the density of each material particle 
remains the same during the motion: 


p(t, w(t, ¥)) = p(0, y) - (1.14) 
Hence 
Dp 
Dt 0 ‘ 
Because 


divpu = pdivu + uapa , 
it follows from the mass conservation law (1.13) that 


divu =0. (1.15) 
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This is the form that the mass conservation law takes for incompressible flow. 


Sometimes incompressibility is erroneously taken to be a property of the fluid 
rather than of the flow. But as we shall see later, compressibility depends only 
on the speed of the flow. If the magnitude of the velocity of the flow is of 
the order of the speed of sound in the fluid (~ 340 m/s in air at sea level at 
15°C, ~ 1.4 km/s in water at 15°C, depending on the amount of dissolved 
air) the flow is compressible; if the velocity is much smaller than the speed 
of sound, incompressibility is a good approximation. It is true that in liquids 
flow velocities anywhere near the speed of sound cannot normally be reached, 
due to the enormous pressures involved and the phenomenon of cavitation. 
But we will now discuss other effects that cause density variations. 


Incompressible flows with variable density 


Incompressibility does not imply p = constant. An example of incompressible 
flow with p # constant is the flow of water with salt dissolved in it with 
concentration c(æ, t). Neglecting diffusion, the amount of solute in a material 
particle is constant: 


c(t, a(t, y)) = c(0, y) - (1.16) 


The density is a function of c only. Therefore (1.14) is satisfied, and the flow is 
incompressible according to our definition. If diffusion is taken into account, 
the concentration is governed by the law of conservation of solute, given by 


De 1 1 
Ti = z Eca) a +5: (1.17) 
This equation will be derived in Sect. 1.11. Now (1.16), hence (1.14) is not 
satisfied. From equations (1.13) and (1.17) it follows that the mass conserva- 
tion law becomes 

l Dp __ ldpDe _ ldlnp 


divu = — -== = -+2 = - 
we p Dt pdc Dt p de 





{(ke,a),0 +4}. (1.18) 
Since In p is a known function of e we have effectively eliminated p, just as 
in the incompressible case. 

1.5 Conservation of momentum 

Body forces and surface forces 


Newton’s law of conservation of momentum implies that the rate of change of 
momentum of a material volume equals the total force on the volume. There 
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are body forces and surface forces. A body force acts on a material particle, 
and is proportional to its mass. Let the volume of the material particle be 
dV (t) and let its density be p. Then we can write 


body force = f’ pdV (t) . (1.19) 


A surface force works on the surface of V(t) and is proportional to area. The 
surface force working on a surface element dS(t) of V(t) can be written as 


surface force = f*dS{t) . (1.20) 


Conservation of momentum 


The law of conservation of momentum applied to a material volume gives 


e puadv = | favs | fas. (1.21) 


v(t) V (t) S(t) 


By substituting ¢ = pua in the transport theorem (1.9), this can be written 
as 





Ipua 
fi E + (pPuaug) a }dV = f otav+ | fas. (1.22) 
V(t) V(t) S{t) 


It may be shown (see Aris (1962)) there exist nine quantities Tag such that 
fa = Taste , (1.23) 


where Tag is the stress tensor (more on tensors in Chap. 11) and n is the 
outward unit normal on dS. By applying Theorem 1.2.1 with ¢ replaced by 
Tag and na by ng, equation (1.22) can be rewritten as 





Ipua 
[GE + Pues) odd =f (ofk + Tas,o)dV 
v(t) v(t) 


Since this holds for every V(¢), we must have 


Ipua 
öt 
which is the momentum conservation law. The left-hand side is called the 


inertia term, because it comes from the inertia of the mass of fluid contained 
in V(t) in equation (1.21). 


+ (pets),a = Taga + fe » (1.24) 





An example where f° Æ 0 is stratified flow under the influence of gravity. 
This will be discussed in Sect. 1.14. 
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Constitutive relation 


In order to complete the system of equations it is necessary to relate the 
stress tensor to the motion of the fluid. Such a relation is called a constitutive 
relation. A discussion of constitutive relations would lead us too far. The 
simplest constitutive relation is (see Batchelor (1967)) 


l 
Tag = —påaf + 2u(€as -7 3 Abas) ’ (1.25) 


where p is the pressure, da, is the Kronecker delta, yz is the dynamic viscosity 
coefficient, egg is the rate of strain tensor, defined by 


1 
Cag = 5 (Yag + Usa); 
and 
A= €aq = divu. 


In many fluids u depends on temperature, but not on pressure. Fluids satis- 
fying (1.25) are called Newtonian fluids. Examples are gases and liquids such 
as water and mercury. Examples of non-Newtonian fluids are polymers and 
blood. Rheology is the discipline that studies the motion of non-Newtonian 
fluids. We will not discuss rheological flows. 


The Navier-Stokes equations 


Substitution of (1.25) in (1.24) gives 


Ua 1 
g T (Pttatts),6 = -P,a + 2{u(Cap — 3Adaa)},a + pfa- (1.26) 


These are the Navier-Stokes equations. Because of the continuity equation 
(1.13), one may also write 


Dux 
Di 





1 
p = -P,a + 2tu(eas — z Afan)} g + pfa- (1.27) 


Still another form may be obtained by using the identity (which may be 
verified by inspection) 


1 
Ugta,p = zl pup),a 7 (u x Wa 3 (1.28) 
where the vorticity w is defined by 


w = curle. 
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Substitution of (1.28) in (1.27) gives 


du. 1 l 
p-r T 3P te) a plu x w)a = -p,a + 2{u(eap = 3Adas) ta + pfa- 


(1.29) 


In incompressible flows A = 0. If, furthermore, u = constant then we can use 
Us,as = Us, Ba = 0 to obtain 


Dug 
p Di = —P a + Hua pp + pf? . (1.30) 





This equation was first derived by Navier (1823), Poisson (1831), de Saint- 
Venant (1843) and Stokes (1845). 


Making the equations dimensionless 


In fluid dynamics there are exactly four independent physical units: those 
of length, velocity, mass and temperature, to be denoted by L,U,M and 
T,, respectively. From these all other units can be and should be derived in 
order to avoid the introduction of superfluous coefficients in the equations. 
For instance, the appropriate unit of time is L/U; the unit of force F follows 
from Newton’s law as MU?/L. Often it is useful not to choose these units 
arbitrarily, but to derive them from the problem at hand, and to make the 
equations dimensionless. This leads to the identification of the dimensionless 
parameters that govern a flow problem. An example follows. 


The Reynolds number 


Let L and U be typical length and velocity scales for a given flow problem, 
and take these as units of length and velocity. The unit of mass is chosen as 
M = p, L? with p, a suitable value for the density, for example the density in 
the flow at upstream infinity, or the density of the fluid at rest. Dimensionless 
variables are denoted by a prime: 


2 = z/L, u =u/U, p = p/ pr . (1.31) 
In dimensionless variables, equation (1.26) takes the following form: 


L Opu, fro _ 1 2 1 l L: b 
y g Tt (Mats). = Fae t UL teas ~ gAdaa)}.e + TP Sa » 


(1.32) 





where now the subscript ,a stands for a/dz,,, and Cag = (ul, at Us ahs 


A =e a. We introduce further dimensionless quantities as follows: 


1.5 Conservation of momentum 17 


 =Ut/L, p =p/pU?, (Ff) = SF. (1.33) 


~ U? 


By substitution in (1.32) we obtain the following dimensionless form of the 
Navier-Stokes equations, deleting the primes: 








pua _ 1 
ETE + (puaug) g = -P,a + {Re (eag — Abas) ha + £2 
where the Reynolds number Re is defined by 
Re = PUL. 
H 
The dimensionless form of (1.30) is 
Dua _ 
> = -P a + Re tua gg + fe. (1.34) 


The transformation (1.31) shows that the inertia term is of order p,U?/L 
and the viscous term is of order pU/L*. Hence, Re is measure of the ratio 
of inertial and viscous forces in the flow. This can also be seen immediately 
from equation (1.34). For Re >> 1 inertia dominates, for Re < 1 friction (the 
viscous term) dominates. Both are balanced by the pressure gradient. 


In the case of constant density, equations (1.34) and (1.15) form a com- 
plete system of four equations with four unknowns. The solution depends 
on the single dimensionless parameter Re only. What values does Re have 
in practice? At a temperature of 15°C and atmospheric pressure, for air 
we have y/p = 1.5* 107° m?/s, whereas for water p/p = 1.1» 10-® m?/s. 
In the International Civil Aviation Organization Standard Atmosphere, 
p/p = 4.9 x 107° m?/s at an altitude of 12.5 km. This gives for the flow over 
an aircraft wing in cruise condition at 12.5 km with wingcord L = 3 m and 
U = 900 km/h: Re = 1.5 x 10. In a windtunnel experiment at sea level with 
L= 0.5 m and U = 25 m/s we obtain Re = 8.3 10°. For landing aircraft at 
sea-level with L = 3 m and U = 220 km/h we obtain Re = 1.2 * 10’. Fora 
house in a light wind with L = 10 mand U = 0.5 m/s we have Re = 3.3 «10°. 
Air circulation in a room with L = 4m and U = 0.1 m/s gives Re = 2.7* 104. 
A large ship with L = 200 m and U = 7 m/s gives Re = 1.3 x 10°, whereas 
a yacht with L = 7 m and U = 3 m/s has Re = 1.9 x 10”. A small fish with 
L= 0.1 m and U = 0.2 m/s has Re = 1.8 x 104. 


All these examples have in common that Re >> 1, which is indeed almost 
the rule in flows of industrial and environmental interest. One might think 
that flows around a given shape will be quite similar for different values of 
Re, as long as Re >> 1, but nothing is farther from the truth. At Re = 107 
a flow may be significantly different from the flow at Re = 10°, in the same 
geometry. This strong dependence on Re complicates predictions based on 
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scaled down experiments. Therefore computational fluid dynamics plays an 
important role in extrapolation to full scale. The rich variety of solutions of 
(1.34) that evolves as Re -+ œ is one of the most surprising and intesting 
features of fluid dynamics, with important consequences for technological] ap- 
plications. A ‘route to chaos’ develops as Re -+ ov, resulting in turbulence. 
Intricate and intriguing flow patterns occur, accurately rendered in master- 
ful drawings by Leonardo da Vinci, and photographically recorded in Hinze 
(1975), Nakayama and Woods (1988), Van Dyke (1982) and Hirsch (1988). 
We return briefly to the subject of turbulence in Sect. 1.13. 


The complexity of flows used to be thought surprising, since the physics 
underlying the governing equations is simply conservation of mass and mo- 
mentum. Since about 1960, however, it is known that the sweeping general- 
izations about determinism of Newtonian mechanics made by many scientists 
(notably Laplace) in the nineteenth century were wrong. Even simple classic 
nonlinear dynamical systems often exibit a complicated seemingly random 
behavior, with such a sensitivity to initial conditions, that their long-term 
behavior cannot be predicted in detail. For a discussion of the modern view 
on (un-)predictability in Newtonian mechanics, see Lighthill (1986). 


The Stokes equations 


For Re | 0 the system (1.34) simplifies to the Stokes equations. If we multiply 
(1.34) by Re and let Re | 0, the pressure drops out, which cannot be correct, 
since we would have four equations (Stokes and mass conservation) for three 
unknowns ua. It follows that p = O(Re~'). We therefore substitute 


p=Re 1p. (1.35) 


From (1.35) and (1.33) it follows that the dimensional (physical) pressure is 
pUp /L. Substitution of (1.35) in (1.34), multiplying by Re and letting Re | 0 
gives the Stokes equations: 


Ua AA — Pa = 9. (1.36) 


These linear equations together with (1.15) were solved by Stokes (1851) for 
flow around a sphere. 


The Stokes paradox 


One might think that (1.36) governs flows with Re < 1 (very slow or very 
viscous flows), but Stokes (1851) shows that (1.36) does not have a solution 
for unbounded flow around an infinite cylinder, with uniform flow at infin- 
ity. Since such a flow seems physically acceptable, there must be something 
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lacking in the mathematical model (1.36). In fact, in two dimensions the 
Stokes equations do not in general provide a good low Re approximation to 
the Navier-Stokes equations, that is uniformly valid everywhere in the flow. 
This surprising fact later came to be called the Stokes paradoz. An impor- 
tant contribution to the understanding of the situation was made by Oseen 
(1910). 


The Oseen equations 


Suppose that the flow deviates only little from a uniform flow, as for exam- 
ple far from a body placed in an unbounded uniform flow. Let the flow be 
governed by (1.34). We write 


ui = |+ Evi, Ua =Eva, ar>l, e<l. 


This we substitute in (1.34) with f° = 0. We introduce p = Ep , take £ | 0, 
delete the prime and obtain the Oseen equations: 


Ya, = —P,a + Reo vapp | 
proposed by Oseen (1910). 


The Oseen equations not only describe almost uniform flows, but also slow 
(Re < 1) flows (if one replaces v again by u), in any number of dimensions. 


The study of the limit Re | 0 of the Navier-Stokes equation is far from trivial, 
and has given an important impetus to singular perturbation theory. See Van 
Dyke (1975) for more information about the asymptotic behavior of Navier- 
Stokes solutions as Re | 0. Singular perturbation theory will be introduced 
in Chap. 2, and will be found to provide useful a priori information about 
the nature of solutions to flow problems, also for Re > 1. 


1.6 Conservation of energy 
The first law of thermodynamics 


The first law of thermodynamics says that work done on a closed system plus 
heat added to it equals the increase of the sum of kinetic and internal energy of 
the system. The kinetic energy per unit mass of a material particle is tugta. 
Its internal energy per unit mass is denoted by e. This is a state variable, like 
p and p. According to thermodynamics, for simple systems there are at most 
two independent state variables, on which the others depend algebraically. 
We will restrict ourselves to simple systems. In principle one may select any 
two of the state variables as independent variables. 
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The energy equation 


By applying the first law of thermodynamics to a material volume V(t) we 
find 


S f pbv=w+a, (1.37) 
V(t) 


with E the total energy per unit mass, given by 
1 
E=e+ pla tha . 


Furthermore, W is the rate of work expended by the surroundings on the 
fluid in V(t), and Q is the rate of heat addition. The body force (1.19) does 
work at a rate u - f?pdV (t), and the surface force (1.20) does work at a rate 
u- f*dS(t), so that 


W = f Uaf? pdV + J UgfidS . 
V(t) S(t) 
Using (1.23) and Theorem 1.2.1 this can be written as 
W= f {puafe + (Wares), a}4V , (1.38) 
V(t) 


Assuming that heat is added to each material particle at a rate q per unit of 
mass, and that there is a heat flux o per unit of area through S(t), we find 


Q= J pqdV + J ads . 
V(t) S(t) 
Let heat diffusion be governed by Fourier’s law: 
c =kn-gradT , 


with k the thermal conductivity, and T the temperature, which is another 
state variable. Using the divergence theorem we find 


Q= f {01+ Ta) a}aV. (1.39) 
V(t) 


By application of the transport theorem 1.3.1 to (1.37) and substitution of 
(1.38) and (1.39) we obtain 
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OpkE 
J { i + (pua E) a }dV = J {(UaTas),p + (KT a),a + puaf? + pq}dV ; 
V(t) v(t) 


Since this holds for every V(t), we have 


OpE 
a + (Pua E) u = (UaTag) o + (kT.c),a + Plate + p4. (1.40) 


This is the energy equation. 
It is useful to put the energy equation in another form. By using the mass 
conservation equation, the energy equation can be written as follows: 
DE 
P Dt 


Similarly, the momentum equation (1.24) can be written as 


= (UaTaa),a + (kTia) a + plate + pa - (1.41) 


u 
P = Tap a + pÍ? - 


Multiplication by ua and summation gives 
ae = b 1.42) 
Dy 2ta) = UaTap, p + Puafa - (1. 
Subtraction of (1.42) from (1.41) and substitution of (1.25) gives 
De 
Dt 
Because €ag = €ga we have ta geag = Caplas: Furthermore, 


1 1 1 
ECapEaf — z4 = (eag — 3 Ada) (eas — 3 Abas) ; 


= —pA — 2pA? + 2pua,peag + (kT a),a + P4 - 


since Jagda3 = 3. Thus we obtain the following form of the energy equation : 


e 
PTE =-på +4% + (kT a), a + pq ’ (1.43) 
where 


$ = 2p (Eas — + Abas) (eap — Adag) . 


The terms in this equation have the following physical interpretation. The 
left-hand side represents transport of internal energy by convection. The first 
term at the right is work done by pressure in a (if departures from thermody- 
namic equilibrium are neglected) reversible process (the thermodynamic con- 
cepts of reversibility and irreversibility will be discussed in the next section). 
The second term % is non-negative, and represents irreversible generation 
of heat by dissipation of mechanical energy by frictional heating. The third 
term represents transfer of heat by diffusion. The fourth term represents heat 
addition or withdrawal by external sources. 
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1.7 Thermodynamic aspects 
Equation of state 


We have obtained a total of five conservation equations, namely one for mass, 
one for momentum in each coordinate direction, and one for energy. There 
are seven unknowns: p, Ua (a = 1,2,3), p, T and e. The system of equations 
is completed by equations of state. As said before, for simple thermodynamic 
systems there are at most two independent state variables, on which the 
other state variables depend. Which state variables can best be selected to 
be independent depends on circumstances. We present two examples: the 
compressible flow of a perfect gas and incompressible flow with heat transfer. 


Perfect gas 


A gas is called perfect if the following equation of state holds: 
p= pRT 3 (1.44) 


with T the temperature measured in K (Kelvin). The Kelvin scale is related to 
the centigrade seale by a shift: 0°C = 273.16 K. When expressed in Kelvin, 
T is called the absolute temperature. The parameter R is a characteristic 
constant for a particular gas. 


According to thermodynamics, for a perfect gas the internal energy e depends 
on T only, which gives us a second equation of state: 


e=e(T). (1.45) 


We now have seven equations: five conservation laws and two equations of 
state. 


Because the conservation laws contain time-evolution equations for p and e, 
it is best to take these as our two independent state variables. Because of 
the explicit occurence of T in (1.44) and the simplicity of relation (1.45) it 
may be a little more convenient to use T instead of e. Given p and T, the 
other state variables p and e follow from the two equations of state (1.44) 
and (1.45). 


Enthalpy and entropy 


For later use we introduce here two more state variables, namely enthalpy 
and entropy. The enthalpy h is defined by 


h=e+p/p. (1.46) 
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For a perfect gas we have 
h =e(T) + RT = A(T). (1.47) 
A law governing h in flows will be presented in the next section. 


We define 
Cy = =, Gp E (1.48) 


where c, is called the specific heat at constant volume and cp is called the 
specific heat at constant pressure. From (1.47) it follows that 


R=c¢p—Cy- (1.49) 


A gas is called calorically perfect if c, = constant. From (1.49) it follows that 
then also cp = constant. 


The entropy per unit mass s is defined by 
Tôs = de + pd(1/p) , (1.50) 


where 6 indicates small changes in the state variables. This can be rewritten 
as follows. From (1.46) it follows that 


de = h — (p/p) . 


Substitution in (1.50) gives 
1 
Tés = 5h — dp. (1.51) 
p 


A reversible thermodynamic process is a process in which heat is transformed 
into work or vice-versa in such a way that the process can be reversed without 
loss. An adiabatic thermodynamic process is a process in which no heat is 
transferred to or from the system. Therefore, in an adiabatic flow q = 0 in the 
energy equation, and solid walls (if any) are thermally insulated. The second 
law of thermodynamics says that in a closed system, i.e. an adiabatic system 
that exchanges no work with its environment, the entropy of the system does 
not decrease. It may be shown that only if the process is also reversible the 
entropy mains constant. 


Law for entropy 
A law governing the variation of s in the flow of a perfect gas can be ob- 


tained as follows. Application of (1.50) to a material particle gives the Gibbs 
equation: 
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Ds De p Dp 


Substitution of (1.43) and (1.13) gives 
1 1 
T= Gt ETa) a +q. (1.52) 


This is another form of the energy equation. We see that frictional heating 
increases the entropy, since y > 0. The second and third term of the right- 
hand side present interaction of the material particle with the environment, 
so that the material particle is not a closed system; accordingly, these terms 
may have either sign. In adiabatic inviscid and non-heat-conducting flow 
(q = p = k = 0) we find Ds/ Dt = 0, in which case the flow is called isentropic. 
Note that although s is constant for each material particle, different particles 
may have different entropy. If all material particles have the same constant 
entropy, the flow is called homentropic. 


Homentropic flow 


In homentropic flow, s = sọ = constant, so that there is only one independent 
state variable left, and the equations of state simplify. Take for example s and 
p as independent state variables. From equation (1.50) it follows that for a 
perfect gas infinitesimal changes of state variables satisfy 


T 
0 = bs =~ Tb. (1.53) 


For a perfect gas equation (1.44) gives 
p = RTdp+ pR6T . 
By elimination of T with (1.53) we obtain 
óp = RT(1 + R/cy)dp. 
By using (1.44) to eliminate T and (1.49) this can be written as 
op _ oP 


p po 


with y = c,/c, the ratio of specific heats (y = 7/5 for air). From this we 
obtain the following equation of state for homentropic flow of a perfect gas: 


p/p’ = po/p, = constant . (1.54) 


In isentropic flow (1.54) holds along particle paths, with the constant differ- 
ing per particle path. 


In Chap. 10 we will go deeper into the role of the entropy. For more infor- 
mation on the thermodynamics of flows, see Liepmann and Roshko (1957). 
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Low subsonic flow with heat transfer 


Consider a flow with a non-uniform temperature distribution. The density of 
a fluid particle may change due to heat conduction. The speed of the flow is 
assumed to be small with respect to the speed of sound; such a flow is called 
low subsonic. As will be seen in Sect. 1.12, this implies that the pressure vari- 
ations in the flow are so small, that their effect on the other thermodynamic 
quantities is negligible. Therefore we use p as one independent variable; T 
will be the second. Equation (1.51) gives the variation of s with p and T. For 
ðs/ðT we get 


ðs _ Oh _ 
ôT oT? 


Substitution in the energy equation (1.52) gives, assuming 0s/Op = 0 for the 
reason given above, 


DT 1 1 
om = pet p Taha +q. (1.55) 


This is the equation of heat transfer for low subsonic flow. If the viscosity is 
small, i.e. the Reynolds number is large, the frictional heating term » may 
be neglected. 


Barotropic flow 


A flow is called barotropic if there exists a function w such that for small 
variations the following relation holds: 


1 
—dp=dw. (1.56) 
p 

This occurs, for example, if p = po = constant, in which case 


w = p/po , 


or if pressure is a function of density only, i.e. we have an equation of state 


p= plp) . 


This is the case if a state variable # is constant. Taking 8 and p as our two 
independent state variables, we have an equation of state p(9, p) = constant, 


p 
so that w = f 1 dp. An example is isothermal flow, i.e. flow in which 


Po 
T = To = constant, of a perfect gas. Taking 6 = T, the equation of state 
(1.44) becomes 
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p= pRIp 
and we obtain 
w= RTo\n(p/po) , 


from which it follows that indeed 
1 1 
Wa = — RTop, a = Pea. 
p p 


A more important practical example is homentropic flow of perfect gas with 
s = sọ = constant. Then we have the equation of state (1.54), from which it 
follows that 


y P 
w= —-. 1.57 
y—Ip (1.57) 
If the gas is calorically perfect we have for the enthalpy 
yY P 


h=eT=—— ?-= 
P yp 


7 


but also if the gas is not calorically perfect we have that homentropic flow of 
a perfect gas is barotropic, with w = A: 
1 
-Pa = ha. (1.58) 
p 
This can be seen as follows. Writing po/pà = ¢ and using (1.54), we obtain 


c _ 1 
ha = Ola = PE), a = pe) a = (0/P) pra = SP. 


R 
(1.59) 


Exercise 1.7.1. Show that (1.57) implies (1.56) for homentropic flow. 


1.8 Bernoulli’s theorem 
The energy equation for an ideal fluid 


An ideal fluid is a fluid in which there is no friction and no heat conduction, 
i.e. 


=k=0. (1.60) 
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For Re > 1 this often is a good approximation. With assumption (1.60) the 
energy equation (1.40) simplifies as follows. We have 


(Ua Tap), g = —(PUa).a - 
Substitution in (1.40) with k = 0 gives 


OpE 
a + {ua(PE + P)},a = plas + 9 - (1.61) 
By using the definition (1.46) of enthalpy this can be rewritten as 
OpE 
“Ft (PuaH),a = Plaka + pa (1.62) 


where H = h+ tuata is the total enthalpy. This is the energy equation for 
an ideal fluid. 


Another useful form of the energy equation for an ideal fluid is obtained by 
rewriting (1.61) as 
DE 
p-p; + YaP,a + Ptaa = Pua fa + P4 - (1.63) 
With (1.13) we find 
1 Dp 
aa = -2 , 1.64 


so that (1.63) can be rewritten as 


or, recalling (1.46), 
L P ua , L. 


where H=h+ Fuata is the total enthalpy. 


Bernoulli’s theorem 


Let us assume that the external force field f° is conservative. That is, there 
exists a potential @ such that 


f? = -gradë . 


Such a © exists if curlf? = 0. Assume furthermore that the flow is adia- 
batic, i.e. there are no heat sources: q = 0, and that the flow is stationary, 
i.e. Op/Ot = 0. Then from (1.65) the following theorem follows, derived by 
Danie! Bernoulli in 1738: 
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Theorem 1.8.1. (Bernoulli’s theorem) 
For stationary flow of an ideal fluid without heat sources in the presence of 
a conservative force field we have 

D p 

—(H +8)=0, H=E+-. 
According to Bernoulli’s theorem, H + ® constant along particle paths in 
stationary flows, and hence also along streamlines. It often happens that all 
streamlines originate in a region where H+ = constant. Then every stream- 
line carries the same value of H +@, so that H+ is constant throughout the 
flow. This may lead to considerable simplification of the governing equations. 


1.9 Kelvin’s circulation theorem and potential flow 
Rate of change of circulation 


Consider the circulation ['(C(t)) arround a closed material curve C(t) as 
discussed in Sect. 1.3: 


According to (1.11), 


dt Dt 
C(t) 
We substitute (1.27) and obtain 
dP(C(t)) _ $ 1 b 
di = (5788 + fodra . (1.66) 
C(t) 


Inviscid barotropic flow 


For the circulation in an inviscid barotropic flow in a conservative force field 
we have the following conservation law, derived by Lord Kelvin in 1869: 


Theorem 1.9.1. (Kelvin’s circulation theorem) 
In inviscid barotropic flow in a conservative force field the circulation around 
closed material curves is constant. 
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Proof. The theorem follows immediately from (1.66): 


a $ (p,a + f°)dz, =— f (w+) dz, =0. 


C(t) C(t) 


Irrotational flow 


Let S(t) be an open material surface (i.e. a surface consisting of material 
particles) bounded by a reducible closed material curve C(t) (a reducible 
curve is a curve which can be shrunk to a point by continuous deformation 
without leaving the flow domain). Then according to Stokes’s theorem 1.2.3 
we have 


with w the vorticity, defined as 
w=curlu. 


If w = 0 the flow is called irrotational. If the flow is irrotational at t = Q and 
if Kelvin’s circulation theorem applies, then 


0=r(ci) = | f w-nas 
S(t) 


for every C(t) and S(t), so that w(t, x) = 0. Hence we may assert that an in- 
viscid barotropic flow in a conservative force field that is initially irrotational 
remains irrotational. Similarly, if C(¢) was in an irrotational part of the flow 
domain at some previous time to then F'(C(t)) = 0 (if Kelvin’s circulation 
theorem applies). Hence we may assert that along particle lines originating 
in an irrotational part of the flow we have w = 0. 


Incompressible potential flow 


If w = 0 then we have (1.1): 
u = grady , 


with y the velocity potential. If the flow is incompressible then (1.15) gives 
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Yaa=0. 


This is the potential equation for incompressible flow. It is simply Laplace’s 
equation. 


Compressible potential flow 


Let the flow be compressible and irrotational. Assume that the flow is in- 
viscid and barotropic and that the force field is conservative field (otherwise 
the assumption of irrotationality would be highly unlikely). The momentum 
equation in the form (1.29) gives 





PPa l 
at +(59,67,6 +w+),q =0. 
It follows that 
ð 1 
z tzu u+w+®= Ho, (1.67) 


with Ho a function of time, that follows from the boundary conditions. For 
example, for potential flow about an oscillating wing without body forces 
with uniform velocity of magnitude U at infinity, Ho = U 2 + Woo- 


The mass conservation equation for potential flow is 


ay +t (PP a)a =0. (1.68) 


Equations (1.67) and (1.68) contain two unknowns y and p. The most im- 
portant example of inviscid barotropic flow is homentropic flow. For a perfect 
gas we have (1.59), so that w = h. Assuming the gas to be calorically perfect, 
TPL I Pih 


— P yr? 


yip y—-Ipi? Pr 


} 


with subscript r referring to some reference state. We obtain with (1.67): 


p h _ 1 ð _ 
p = Gg) OY = {Ho ~ 8 — guu g/h O. (1.69) 


The system now consists of (1.68) and (1.69). 


Specializing further to steady flow without body forces, equation (1.67) be- 
comes 


1 
ztu th= Ho, (1.70) 


1.9 Kelvin’s circulation theorem and potential flow 31 


so that Ho equals the value of h in points where u = 0, hence Ho is the 
stagnation enthalpy. We have recovered Bernoulli’s theorem. With stagnation 
as reference state, indicated by subscript zero, equation (1.69) becomes 


1 
Z = (1= gu u/ Ho) Om" (1.71) 


In the stationary case, equation (1.68) becomes 


(P¥,a),a=0. (1.72) 


The system of equations now is (1.72) and (1.71). The reference density is 
found as follows. We have 


YPo 
=m: 1.73 
(y — 1)Ho l ) 
From (1.54) it follows that 
Po = Po (Po/Pæ) - (1.74) 


Let conditions at infinity be given. Combination of (1.73) and (1.74) results 
in 
(QPB ia- 


1.75 
YPoo l ) 


po = 
This fully specifies the equations for potential flow. Equation (1.72) with p 
given by (1.71) and (1.75) is called the full potential equation. The adjective 
full refers to the fact that, apart from the assumptions required for potential 
flow, no further simplifications have been made. 


It is instructive to rewrite the equation for p in the following manner. As wiH 
be shown in Sect. 1.12, the local speed of sound a is given by 


dp_ pP 
7 qo 15 = (YA 
p P 
where we have used (1.54). Hence 
Ho = laut P= lauta? /(y- 1). 
2 y—lp 2 
The Mach number M is defined as 
M = |ul/a. 


Hence, equation (1.71) can be written as 


p Y= 1 yyay--1) 
— = (1 + ——M Y , 
po 2 ) 
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In Sect. 13.3 of Hirsch (1990) it is shown that (1.72) is elliptic for M < 1 and 
hyperbolic for M > 1. 


The field of computing potential flows is mature; see Hirsch (1990) Chap. 13- 
15 for a survey. In this book we will not pay further attention to the potential 
flow model. Potential flow codes are in daily use for quickly computing flows 
around aircraft and in flow machinery in the predesign phase. When coupled 
with boundary layer computations (also not discussed here), accurate pre- 
dictions of lift and drag are possible when flow separation does not occur. 
But in the presence of separation or strong shock waves the more complete 
Euler or Navier-Stokes equations must be used. These form the main subject 
matter of this book. 


1.10 The Euler equations 


Flow of ideal fluids is governed by the Euler equations. We have the mass 
conservation equation (1.13), the momentum equations (1.26) with p = 0 
and the energy equation for an ideal fluid (1.62): 





Op _ 
E+ (pala = 0, (1.76) 
Optta 

eS + (ptatis),a + P,a = Pha » (1.77) 
OpE b 
> t (Pua H), a = plat, + P4 - (1.78) 


We have five equations and seven unknowns: p, ua, p, e and h. The system 
is completed by the equations of state, discussed in Sect. 1.7. For a perfect 
gas the equations of state are (1.44), (1.45). Elimination of T gives a relation 


P= plp,e} , 
and the definition (1.46) of the enthalpy A completes the system of equations: 


h=e+p/p. 


Homentropic and barotropic flow of ideal fluids 


There are various cases in which the Euler equations simplify, because the 
energy equation (1.78) can be replaced by an algebraic relation. An example 
is homentropic flow. As shown in Sect. 1.7, we have 


Dp/ _ 
ap =O. (1.79) 
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We now have an equation too many. The point is that the energy equation 
is satisfied automatically. This may be seen by using the alternative form of 
the energy equation (1.43) with u = k = 0: 


— + —Ua,a = gq . (1.80) 
p 








De Dinp 
=" _ RT = 1.81 
Dt Dt (1.81) 
Substitution of (1.48) gives 
DT Dinp 
v= — RT =q, 1.82 
epr D =4 (1.82) 
and with (1.49) we obtain 
D1n(Tp'”) _ 7 (1.83) 





Dt Cyl 


Along a particle path p/p’ = constant, hence RT p'~7 = constant, and the 
left-hand side of (1.83) vanishes. Hence we have a paradox, unless q vanishes. 
This is to be expected, since the assumption of homentropic flow implies that 
no heat is to be added or subtracted. Assuming q = 0, the energy equation is 
satisfied. It is replaced by (1.79). If all particle paths emanate from a reservoir 
with constant entropy, the flow is isentropic, and equation (1.79) reduces to 
an algebraic relation: 


p/P = po/pg =C (1.84) 


where the subseript zero refers to the reservoir conditions. By using (1.84), 
equation (1.77) can be rewritten as 


Opie 
at 


from which we will later draw an analogy with the shallow-water equations. 


+ (Puaug) a +7Cp’* po = pfa » (1.85) 





1.11 The convection-diffusion equation 
Conservation law for material properties 
Let p be a material property, i.e. a scalar that corresponds to a physical 


property of material particles, such as heat or concentration of a solute. 
Assume that y is conserved and can change only through exchange between 
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material particles or through external sources. Let p be defined per unit of 
mass. Then the conservation law for ¢ is: 


$ | av= | g-nds+ f aav. 


V(t) S(t) Vit) 


Here f is the flux vector, governing the rate of transfer through the surface, 
and q is the source term. For f we assume Fick’s law (called Fourier’s law if 
y is temperature): 


f=kgrady, 


with & the diffusion coefficient. By arguments that are now familiar it follows 
that 


a 
-EL + div(ppu) = (Rg,a),a +4 (1.86) 


This is the convection-diffusion equation. The left-hand side represents trans- 
port of » by convection with the flow, the first term at the right represents 
transport by diffusion. The balance between convection and diffusion is char- 
acterized by the Péclet number Pe, defined as 


Pe = UL/(poko) 5 


where L is a typical length scale of the flow, and U, po and ko are typical 
magnitudes of |u|, p and k. For Pe >> 1 we have dominating convection, for 
Pe < 1 diffusion dominates. 


By using the mass conservation law, equation (1.86) can be written as 


pe = (kPa) +q. (1.87) 
The convection-diffusion equation governs the concentration distribution of 
solutes. Note that we have already encountered several times equations that 
closely resemble the convection-diffusion equation, such as the energy equa- 
tion. Since De/Dt = c,DT/Dt, equation (1.43) is a convection-diffusion 
equation for T with special source terms. Also the momentum equation (1.30) 
comes close to being a convection-diffusion equation. Many aspects of nu- 
merical approximation in computational fluid dynamics already show up in 
the numerical analysis of the relatively simple convection-diffusion equation, 
which is why we will devote two special chapters to this equation. 


1.12 Conditions for incompressible flow 


Equations of motion for isentropic flow 


Suppose the motion of the fluid consists of small isentropic oscillations around 
a state of static equilibrium in which u = Q, in the absence of body forces. 
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Taking p and s as our independent state variables, there is an equation of 
state p = p(p,s). For isentropic flow the mass conservation equation (1.13) 
can be written as 


1 Dp . 
zz py 7 dive =0, (1.88) 
defining 
Op 
2 — aiir 
a = (p , 


where the subscript s indicates that the entropy s is kept constant. Note that 
(Op/Gp), > 0, because increasing p (compression) gives increasing p. For a 
perfect gas we have the equations of state (1.44) and (1.54), from which it 
follows that 


a = ye =7RT. 
p 


Because p = 0 in isentropic flow, the momentum equation (1.27) becomes 
Du 
P Dt 


Equations (1.88) and (1.89) are the equations of motion for isentropic flow. 


= —gradp . (1.89) 


The speed of sound 


Let the equilibrium state be denoted by subscript zero. From (1.89) it fol- 
lows that po = constant and from the equation of state (1.54) it follows that 
po = constant. Linearizing equations (1.88) and (1.89) around the equilib- 
rium state we obtain, writing p = po + pi, 


1 Op, . 
ae Ot + podivu = 0, 
Ou 
Poa = —gradp . (1.90) 


By taking the divergence of (1.90) and eliminating u we obtain 


+ 5 = (p) 
a2 O42 = (Pi), œa 


This is the wave equation governing acoustics. In one space dimension the 
general solution is, as can be seen by inspection, 


pi = flaot + z1) + glaot — 21) , 


with f and g arbitrary; f and g present left- and right-traveling waves with 
phase velocity ag, so that we see that ao is the speed of sound. 
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Conditions for incompressibility 


Let L and U be characteristic length and velocity scales of the flow. This 
means that flow properties vary only slightly over distances small compared 
to L, and that variations of u have a typical magnitude U. For example, 
for flow around a sphere U can be taken the velocity at infinity. For low 
Reynolds numbers a suitable choice for L is the diameter D of the sphere, 
but at large Reynolds numbers a correct choice for L is a typical boundary 
layer thickness, leading to L = DRe71/?, 


The flow may be regarded as incompressible if the density of a particle does 
not vary much: 


Using the equation of state p = p(p, s), we can write 


Op, Ds 


1Dp _ 1 Dp 
as’? Dt ` 


Sipe agile 


pDt pa? Dt 


It can be shown (cf. Batchelor (1967)), using (1.52), that the second term 
of the right-hand side contributes significantly only under unlikely circum- 
stances, such as gas flow with LU = 107! cm?/s and T = 100°C. Retaining 
only the first term, the condition for incompressibility becomes 


We now assume isentropic flow, because normally the influence of viscosity 
and heat conductivity is not such as to have a decisive effect on the order of 
magnitude of pressure variations. Hence we can use (1.89) to obtain 


Dp _ Op 
D ðt 


4 u- gradp = Op 1 Du-u 

a Bt 2° Di 
Since in general there is no reason why these two terms should balance, we 
obtain the following two conditions for incompressibility: 


1 ðp U 
> a — 1.91 
R e K F’ (1.91) 

1 Du-u U 
2 Di < T: (1.92) 








Let T ba a typical time scale of the flow. The magnitude of pressure variation 
is pU L/T, as can be seen from (1.89), hence saz Op/ Ot is of order UL/(ar)?, 
so that (1.91) becomes 
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2 
<1. (1.93) 





a?r? 


In acoustics L is the wavelength of a sound wave, i.e. L = ar, and (1.93) is 
not satisfied. We are not surprised to see that in acoustics we cannot assume 
incompressibility. 


Next, consider (1.92). We have D(u-u)/Dt = O(u-u)/dt+ uað(u  u)/ðza. 
The order of magnitude of these term is U?/r and U?/L respectively. Hence, 
(1.92) gives the following two conditions: 


UL 
ar <1, (1.94) 
U? 
qe <1. (1.95) 


Condition (1.94) is not independent, but follows from (1.93) and (1.95) (which 
give (UL/a’r)* < 1). 


In conclusion, the conditions for incompressibility can be written as 
l/r <a/L, (1.96) 
and 
M<l, (1.97) 


where M = U/a is the Mach number. The first condition limits the frequency 
of unsteady phenomena in the flow; the second limits the magnitude of the 
velocity of the flow. In practice, for M < 0.3 incompressibility is a good 
approximation. 


1.13 Turbulence 


As already mentioned in Sect. 1.5, it is a fact of nature that for sufficiently 
large Reynolds numbers flows show rapid apparently random fluctuations, 
even when the controlling factors of the flow, such as body geometry and 
upstream conditions, are stationary. Such flows are called turbulent. Turbu- 
lent flow is a complicated example of a chaotic dynamical system. For an 
introduction to turbulence, see Hinze (1975), Tennekes and Lumley (1982) 
or Libby (1996). Turbulence remains one of the great unsolved problems of 
physics. Accurate prediction of turbulent flows starting from first principles is 
out of the question for realistic applications in engineering, as will be shown 
below. Other fundamentally sound prediction methods have not (yet) been 
found. The difficulty is that turbulence is both nonlinear and stochastic. 
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Direct numerical simulation of turbulence 


Since turbulence is governed by the Navier-Stokes equations, turbulent flow 
can be computed from first principles by solving the Navier-Stokes equations. 
We now show that this is not feasible for general engineering computations. 


Let U and £ be the velocity and length scales of the large eddies in a turbulent 
flow, and let us define the macroscale Reynolds number by 


Ren =UL/V, 


with v the dynamic viscosity coefficient. The length scale of the smallest 
eddies is called the Kolmogorov scale, and is denoted by 7. It has been shown 
by Kolmogorov (see Tennekes and Lumley (1982)) that 


n/L = O(Re;,"*) . 


As an example, let us consider the turbulent flow in a pipe with diameter L 
and mean velocity U. The macroscale velocity U and length £ are found to 
be approximately 


C=L/10, U= Telp, 


with Tẹ the wall friction. The Blasius resistance formula (Goldstein (1965), 
Sect. 155) gives: 


U =0.2URe7 "8, Re=UL/v. 


Hence the macroscale Reynolds number is 


_ lyre 
Rem = Bp Ne . 


This gives us away from the pipe wall 
n/L & 1.9Re7?!3? | 


To resolve the Kolmogorov scale, the mesh size A of the computational grid 
must satisfy A < n. With h = n/2, we obtain for the number of grid cells in 
one direction N = L/h = Re?!/3? so that the total number of cells is 


N? = Re®/3? = Re? , 

For Re = 10°, for example, this gives us N? = 101°, which is clearly not 
sustainable on the computer infrastructures available at present and in the 
foreseeable future. In more complicated geometries and higher Reynolds num- 
bers the required number of grid cells is even much larger. But at moder- 
ate Reynolds numbers and in simple geometries direct numerical simulation 
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is a valuable tool for studying the fundamental properties of turbulence. 
Databases generated by direct numerical simulation can be used in the de- 
velopment and checking of approximate models. Such databases can be found 
at the following website: 
ercoftac.mech.surrey.ac.uk 

Some recent publications on direct numerical simulation are: Eggels, Unger, 
Weiss, Westerweel, Adrian, Friedrich, and Nieuwstadt (1994), Le, Moin, and 
Kim (1997), Moin and Madesh (1998), Na and Moin (1998), Reynolds (1990), 
Skote, Henningson, and Henkes (1998), Verstappen and Veldman (1997), 
Voke, Kleiser, and Chollet (1994). Earlier publications are quoted in Libby 
(1996). The numerical principles discussed in the present book apply to direct 
numerical simulation of turbulence. 


Large-eddy simulation of turbulence 


With large-eddy simulation, the large turbulent eddies are resolved numeri- 
cally, but, in order to diminish the demands put on computer resources, the 
small eddies are modeled heuristically. This is called subgrid-scale modeling. 
The structure of the small eddies is to a large extent independent of the 
particular geometry at hand, and largely universal. Therefore there is some 
hope of obtaining a universal model for the small eddies, but this matter is 
still not completely settled. It would lead to far here to discuss subgrid-scale 
modeling. The numerical principles discussed in the present book apply to 
large-eddy simulation of turbulence. Application of the numerical method 
presented in Sect. 13.5 is described in Manhart et al. (1998). 


For large-eddy simulation of the flow around large aircraft at Re = 107, 
Chapman (1979) has estimated a requirement of 8 - 10° grid cells and 104 
Mwords storage, assuming that large eddies are resolved only where they 
occur, namely in the boundary layer at the surface of the aircraft, and in 
the wake. A crude estimate of computing time may be obtained as follows. 
Taking as a rough guess for the number of floating-point operations per grid 
cell and per time step 500 flop, and assuming 500 time steps are required 
to gather sufficient flow statistics, we arrive at an estimate of a total of 200 
teraflop (1 teraflop = 101? flop). So this type of computation is becoming 
feasible with the teraflop machines that are making their appearance. But 
at present, large-eddy simulation is still largely in the research stage, and 
has not yet developed in an engineering tool. Some recent publications on 
large-eddy simulation are: Boersma et al. (1997), Boersma and Nieuwstadt 
(1996), Ferziger (1996), Galperin and Orszag (1993), Reynolds (1990), Voke, 
Kleiser, and Chollet (1994). Large-eddy simulation databases can be found 
in the ERCOFTAC website mentioned before. 


40 1. Basic equations of fluid dynamics 


Reynolds decomposition 


In engineering practice, turbulent flows are generally modeled by the Reynolds 
averaged Navier-Stokes (RANS) equations, in order to achieve computer time 
and memory requirements that are feasible for industrial applications. Before 
presenting the RANS equations, we discuss the Reynolds decomposition. A 
quantity A(z,t) can be decomposed into a mean value A and a fluctuation 
A as follows: 


A=A+A,, (1.98) 
defining A as 
T/2 
A(t, x) = > | A(t + 7,x)dr , (1.99) 
-T/2 


where T is a timespan that is large compared with the time scale of turbulent 
fluctuations but small compared to the time scale of other time-dependent 
features of the flow. Such a choice of T is often possible, because in fully 
developed turbulence (Re sufficiently large) the time scale of the turbulent 
fluctuations is often very small compared to the time scale of other unsteady 
features of the flow, such as the period of an oscillating body; often, the 
mean flow is stationary. Equations (1.98) and (1.99) define the Reynolds 
decomposition; A is called the Reynolds average (Reynolds (1895)). 


In order to avoid products of fluctuation of density and fluctuations of other 
quantities one also uses the density-weighted mean or Favre (1965) average 
A, defined by 

A= pA/p. 


with the corresponding fluctuation 


Note that 
A=A, A=A, A =0, A” =0, A" =A-A, pA" =0. 


It follows that 


A 
+ A" B") = pAB + pA" B” . (1.100) 
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Reynolds averaged Navier-Stokes equations 


Taking the mean of the mass conservation equation (1.13) gives 


Op 
Ot 


Next, consider the momentum equation (1.26). Applying (1.100), we have 


+ divpu =0. 


Plats = pitatig + puuta , 
Taking the mean of (1.26) gives 
Opti 
ði 


where the viscous stress og is defined by 





+ (Ptigtis),¢ = -P,a tong tong + Pfs. (1.101) 


{| — 
Tag = 2(Heap — zHAbap) » 


and o} is the Reynolds stress, defined by 


Of = Puy Ue - 
Equations (1.101) are called the Reynolds averaged Navier-Stokes equations. 
Note that the turbulent fluctuations cause additional momentum transfer 
through the Reynolds stress. The Reynolds stress is quite often much larger 
than the viscous stress. Frequently, the Reynolds stress is related to the mean 
velocity field by 


` - 1 
Tag = — Ht (to,g + Usa) + zpi ap ’ 


where p is called the eddy viscosity, still undetermined. This is called Boussi- 
nesq’s closure hypothesis. The eddy viscosity needs to be modeled, and 
Ep, Uy is subsumed under the pressure. Unfortunately, significant deviations 
from Boussinesq’s closure hypothesis are common. 


Closure problem 


In order to complete the system of equations, the Reynolds stress o% has to 
be expressed in terms of the mean quantities p, u and p. However, no general 
law for this is known. This is known as the closure problem. In practice semi- 
empirical relations are introduced, leading to so-called turbulence models. 


Many turbulence models have been proposed. The quest for adequate tur- 
bulence models has been vexed by a lack of sufficiently general and sound 
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simplifying principles. In algebraic models, Boussinesq’s closure hypothesis 
is used, with an algebraic equation to relate the eddy viscosity to the primry 
unknowns. The computing cost of algebraic turbulence models is not much 
larger than that of the laminar Navier-Stokes equations, and the numerical 
aspects do not go beyond what is covered in the present book. See Bradshaw 
(1997), Libby (1996) and Wilcox (1993) for a survey of turbulence modeling. 


In one- and two-equation models, the eddy viscosity depends on certain quan- 
tities that obey partial differential equations. These quantities are, typically, 
the turbulence kinetic energy k and the dissipation per unit turbulence ki- 
netic energy w or the turbulence dissipation £. This leads to the k — £ and 
k —w models, which are, generally speaking, more dependable than algebraic 
models, but still suffer from the weakness of Boussinesq’s closure hypothesis. 
The equations governing k,w and £ are modeled heuristically, and involve 
large source and sink terms, that require delicacy in numerical approxima- 
tions, because k,w and € must remain strictly non-negative. This involves 
numerical considerations not covered in the present book; see Mohammadi 
and Pironneau (1994), Wilcox (1993), Zijlema, Segal, and Wesseling (1995), 
Zijlema (1996), Wesseling, Zijlema, Segal, and Kassels (1997) for these as- 
pects. Computing cost is higher than for algebraic models, but significantly 
less than for the class of turbulence models discussed next. 


In Reynolds stress models, Boussinesq’s closure hypothesis is abandoned, and 
differential equations are formulated for the individual components of the 
Reynolds stress tensor. The mathematical properties of the resulting system 
of differential equations are not yet well understood, and research into numer- 
ical approximations continues. Computing cost approaches that of large-eddy 
simulation. Some publications on Reynolds stress models are Dol, Hanjalic, 
and Kenjeres (1997), Hanjalié (1994), Launder (1990), Launder (1996), Shih 
(1997). Boussinesq’s clossure hypothesis can also be replaced by something 
better in the context of two-equation models, as discussed in Launder (1996) 
and in Chap. 6 of Wilcox (1993); this keeps computing cost lower than for 
Reynolds stress models. 


Because of the inadequacies of the existing turbulence models as well as the 
very large computational complexity of advanced models, the prediction of 
turbulent flows is generally considered one of the ‘grand challenges’ of com- 
putational science. In view of our remarks on predictability of nonlinear dy- 
namical systems in Sect. 1.5, we can only hope to obtain accurate predictions 
of statistical averages of flow quantities. And even for these the time interval 
over which these predictions can be accurate is often severely limited. For 
example, the predictability horizon for the atmospheric flows that determine 
the daily weather at middle latitudes is thought to be at most a few weeks, in 
the hypothetical circumstance that the initial and boundary conditions are 
specified exactly. The butterfly on a windowsill in Rio de Janeiro that by the 
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flutter of its wings causes a tornado in Texas (Lorenz (1993)) has become 
proverbial. Aristotle’s (384 BC) admonition should be kept in mind: “It is 
the mark of an educated mind to rest satisfied with the degree of precision 
that the nature of the subject admits, and not to seek exactness when only 


an approximation is possible”.' 


1.14 Stratified flow and free convection 
Stratified fow 


Flows in which heavier and lighter layers of fluid occur are called stratified. 
If the fluid is initially at rest, but the density distribution becomes unstable 
(heavy fluid on top of light fluid), for example by heating from below, flow 
ensues. Free convection is flow driven by gravity through non-uniformity of 
the density distribution. 


Consider flow in the presence of gravity. The momentum equation (1.27) 
becomes 
Due 1 


1 
Dt = ~ phe + p PP + Ga 5 (1.102) 





with g the gravitational acceleration vector, and 


1 
Cap = 2u(€ag — 3 Abas) . 


Let there be density variations in the fluid due to non-uniformity of con- 
centration c of a solute, or due to temperature variations. Let the density 
satisfy 


p= ple). 
The transport equation for the solute is (1.87): 


De 1 1 
Di ~” p (Reva) a +5: 
The mass conservation equation is given by (1.18): 


divu = _idinp 
~ pde 





{(ke.a).a +q}. (1.103) 


These are the equations for stratified flow. If dln p/dc and variations of ¢ are 
small, then (1.103) may be approximated by divu = 0. 


! The author is indebted to Kenjere’ (1999) for this quotation. 
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Hydrostatic equilibrium 


In hydrostatic equilibrium u = 0. Equation (1.102) reduces to 


Lpa = Ja - (1.104) 

p 
This equation can be solved as follows. Consider a fluid with an equation of 
state p = p(p, s). Assume s = constant. This means that there is thermody- 
namic equilibrium and that no heat transfer takes place. Choose one axis in 
the direction opposite to g, and call the coordinate along this axis £; we may 
think of £ as height above the surface of the earth, and take £ = 0 on the 
surface. Then (1.104) becomes 


-= =-9, g=\gl- (1.105) 
This equation can be solved as follows. Denoting conditions at € = 0 by a 
subscript zero, the equation of state for a perfect gas is 
P_ Po 
py A 


By elimination of p, equation (1.105) becomes 


dp- Nit ~ trl p-iyy 
dE y 
The solution is 
p = poll — (y ~ lgé/agy/O-) , (1.106) 
where ao = 1/7Ppo/po is the speed of sound at £ = 0. If 
g€/a5 K1, (1.107) 


the solution (1.106) can be approximated by the following linear pressure 
distribution: 
p = po — pogé . (1.108) 


This is also obtained directly if we assume in (1.105) p = po. Under what 
circumstances does (1.107) hold? Suppose we study the atmosphere. We have 
g = 9.8 m/s’, and under standard circumstances at sea level aj = 314 m/s. 
Then (1.107) gives £ < 10 km. 


If the equation of state is arbitrary, we can proceed as follows. We have 


dp\ _ 
(3) ~ 
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so that (1.105) can be rewritten as 


Ginp _ 
Of 





~—g/a? ’ 


with formal solution 
i 1 
In p/po = -9 f ae. 
0 


If (1.107) is satisfied, we have approximately p = po, and (1.108) is recovered. 
In general Cartesian coordinates, equation (1.108) becomes 


P = Pot poata. (1.109) 


The Boussinesq equations 


Consider a hydrostatic equilibrium state with p = pp given by (1.109) and 
p = po. Let the hydrostatic equilibrium state be perturbed by variations of a 
variable that influences the density, for example temperature or concentration 
of a solute; in any case, denote this variable by T. The fluid may or may not 
be set in motion as a result. We have an equation of state p = p(p, T). If the 
velocity is small compared to the speed of sound, the dependence of p on p 
may be neglected, and we may write p = p(T). Let T = To be the uniform 
distribution of T for which we have p(To) = po. Let 


le foo K 1, |p l/pa K 1, (1.110) 


with p = p — po, p =p- ph. We approximate the terms in (1.102) using 
(1.110), assuming the flow is driven by gravity. Because the pressure gradient 
and gravity may balance, as in hydrostatic equilibrium, it is the difference in 
these two terms that drives the flow. These terms will therefore be approxi- 
mated together. We may write 


l ] Pp 1: 
p a & Ga poe af oo? poe ( ) 
PP Pann P_ l 
ÖO(—, (—)°par) = Ja - —P a 
Po Po o po 


Using (1.110) we see that the neglected terms are negligible compared to the 
terms kept. Furthermore, 


t 


1 1 p 1 
-0an = — (1 + Of—))oag = —oas . 1.112 
5000 = (1+ OE ))oap = —eap (1.112) 
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At first sight this approximation seems unjustified, because we neglect a term 
of Olp /po), while terms of this magnitude are kept in (1.111). However, 


because (1.111) is assumed to be the driving term, Cap, = Oe, tpa), so 


that the neglected term in (1.112) is in fact quadratic in the perturbations. 
Using the equation of state we may write 


p = —bpoT , (1.113) 


with T = T — Tọ, and 





with b the thermal expansion coefficient of the fluid. Substitution of (1.111)- 
(1.113) in (1.102) gives 


Due 1 t 1 t 
— = —-— — —GobT . 1.114 
Dt poh + po Ga ( ) 


These are the Boussinesq equations. The flow that results if the hydrostatic 
state is unstable is called free convection. 


Dimensionless Boussinesq equations 


The following units are chosen: 


density: po 

length: L (a typical length scale of the flow domain) 
velocity: U = TI (with xo a reference viscosity value) 
time: r= L/U 


temperature: AT (a typical magnitude of the variations in T that are 
imposed by the boundary conditions) 
pressure: P = pU? 


The dimensionless form of (1.114) is found to be, keeping the same symbols 
(i.e. u := u/U, p := H/Ho, Ja = JaT?/L etc.): 


Dux t ' 

p= P,a + F089 — GGaT , (1.115) 

where the dimensionless number G, called the Grashoff number, is defined by 
G = bATg L? p/p? . 


The system of equations is completed by the mass conservation equation and 
the energy equation for T . The mass conservation equation is given by (1.18). 
Using (1.110) this is approximated by 
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divu = 0. (1.116) 


The energy equation is given by (1.55) with y = 0. In dimensionless variables 
we have p = 1, and with g = 0 we obtain the following dimensionless form of 
the energy equation: 


— = Pro (kT), (1.117) 


where the dimensionless number Pr is the Prandtl number, defined by 


Pr = ko/(cppo) - 


The equations governing free convection (1.115)-(1.117) are called the Boussi- 
nesq equations. . 


1.15 Moving frame of reference 


In applications with a rotating geometry, such as helicopter blades, turbo- 
machinery or the atmosphere or seas and oceans, it is necessary to use a 
rotating coordinate system. Therefore we will present the laws that govern 
fluid motion in a moving Cartesian coordinate system. 


Assume that we have a Cartesian reference frame that is not inertial but that 
is moving. Let the instantaneous linear acceleration of the origin be a and let 
the instantaneous rotation around the origin be 92. Batchelor (1967) Sect. 3.2 
shows that the mass, momentum and energy conservation equations are iden- 
tical in form to those in an inertial frame, provided we add the fictitious body 
force 


- d 
P =-a-20 xu- E xe- Nx (Nx a) 


per unit mass to the real body force f°. This modification is to take place 
both in the momentum equation (1.26) and in the energy equation (1.40). 


The term —2§2 x u is the Coriolis acceleration. Note that u is the velocity 
relative to the moving frame. The last term can be rewritten as follows. Let 
æ =y +r with y the orthogonal projection of æ on $2: 


y = (æ N/N- MR. 
Then N x z = N x r, and r: N = 0, so that 


Nx(Nxæ)=Nx(Nxr)=(N-r)N-—(N. Nr 
=-Pr, M=N-N. 
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This is recognized as centrifugal acceleration. 


The Cartesian components of f° are: 


dQ? 
fe = da — Eagy{22guy + et} +2ro, r=e-y. 


1.16 The shallow-water equations 
Depth-averaged continuity equation 


Consider a three-dimensiona!l domain in which water flows with a free surface 
under the influence of gravity. The situation is illustrated in Fig. 1.2. The 


X3 


z 
aerer e 
X1 
Fig. 1.2. A free surface flow problem 


£3-axis is in the direction opposite to that of gravity. The water elevation 
is €(t,21,22) and the bottom topography is given by H(t, z1, £2), so that 
the water depth is d = € — H. The free surface may also be defined by 
F(t, £) = x3 — E(t, #1, £2) = 0. Since the free surface moves with the flow, a 
material particle that is at the free surface stays at the free surface (except 
in singular points), so that DF/ Dt = 0, or 


ðE ðE dE 


Bt be, t ge T (1.118) 
Similarly, 

OH OH OH 

oe + “9g, + Ge, TO (1.119) 


In the shallow-water approrimation the depth-averaged horizontal velocity 
components U; and U2 are used as unknowns instead of u1, uz: 


Ux = 


ap ee 


£ 
J vades, œ = 1,2. 
H 


1.16 The shallow-water equations 49 


The density p is assumed to be constant. Integration of the continuity equa- 
tion (1.15) over x3 gives 


ð ð 
j (a t gads + ult 1, 22,€) = ualt en ea H)=0. (1.120) 


H 


We have, summing over a = 1, 2: 





Iua ð ðH 
j ” dz £3 = > i (Ws )- tha (ty 21, 29, £) È + ualt, #1, £2, H H) 


OF a E 


By using (1.118) and (1.119) and substitution in (1.120) we obtain the depth- 
averaged continuity equation: 


Od ð ð 


where we have used 0(€ — H)/ôðt = ðd/ðt. 


Depth-averaged momentum equations 


Since many interesting applications of the (to be derived) shallow-water equa- 
tions arise in geophysics, it is of interest to take the rotation of the earth into 
account by choosing a rotating frame of reference. But the dorain is as- 
sumed to be small enough to be assumed flat, and we will continue the use 
of Cartesian coordinates. Including the effect of Coriolis and gravitational 
acceleration, according to the preceding section we have a body force 


f?=g—-2Nxu 


in the momentum equations (1.27), where 9 is the rotation vector of the 
earth. The centrifugal acceleration is negligible in geophysical applications. 


In the shallow-water approximation it is assumed that |u3| < |uj|, |u2| and 
u3 is approximated by zero. This gives, neglecting the vertical component of 
the Coriolis acceleration because it is very small compared to gravity: 


2N x u = (— fuz, fui,0), f=2|A\sing, (1.122) 


with @ the geographic latitude. With u3 = 0, the z3-component of (1.27) 
becomes: 


so that 
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p = F(x1,£2) — pgzs , 


with F still to be determined. At the free surface p equals the atmospheric 
pressure pa. This determines F, and we obtain 


p = Pa + pg(E — 23). (1.123) 
The remaining momentum equations become 
Duy _ + uu 4 fu 
P Di — Pil H 1,88 P 2, 
Du _ + pu fu 
Pe T TPT 2,88 — Pju . 


To exploit the shallow-water approximation, we integrate over the depth. We 
stipulate that the range of Latin indices is {1,2}. The summation convec- 
tion is temporarily extended to Latin indices. Because divu = 0 we have 
ugOu,;/Oxg = Ou; ug/Oxg. Integration of the inertia terms gives: 


£ £ 

Ouj ug _ ð 
| = Ir; f uyida 
H H 


OE OH 
~iu (Ue BE — u3)}lesre + {uj (Uk a — U3) }Ujles=H 


0 Gla OH 
= az, ujukdE3 + ge tiles=é — “py “i les=H . 
H 


Furthermore, 


Y 
Ou; ð 0 OH 
J Ges = =; (dU;) — S jlez + pp “i les=H . 
H 


Hence 


E E 
Du; ð O 
—dzr3 = —(dU;) + —— . 
| pi 73 = Us) t Fe, [ udes 
H H 
Let us write 
uj = U; + Uj . 


Then 


£ 
fiur = dU;Upk + Ojk,k s 
H 


1.16 The shallow-water equations 51 


with 
£ 
Ojik = J tiuaes . 
H 


Next, oj, is neglected, which is justified only if |ū;| « |U;], in other words, 
if ua depends only weakly on x3. Due to the no-slip condition, this will not 
be the case near the bottom. For more accurate modeling, depth-averaging 
would have to be done over more than one layer, or abandoned altogether, 
but we will not do this here. Proceeding with these approximations, we now 
have: 


£ 
Du; m f ð l 
J Fian = 5 (dj) + Je, UU) , 
H 


Because of (1.121) this may also be written as 


£ 
Du; ma Ou; OU; 
J spite = dG + ay Ox, . 
H 


The next term to be depth-averaged is the pressure gradient. By using (1.123) 
we obtain (assuming pa = constant) 


£ 
1 
5 | pides = gdÉ ; . 


H 


The viscous stress term is not depth-averaged. The influence of friction makes 
itself felt mainly at the bottom, but since depth-averaging implies that the 
no-slip condition is not applied, the influence of bottom friction is accounted 
for in a simplified semi-empirical way by means of a force term F}, given by 


Fj = —gU;|U|/@4d , 


with C the so-called Chézy coefficient, and |U| = (U,U,)'/? . The empirical 
parameter C depends on the bottom roughness. 


Depth-averaging of the Coriolis acceleration gives 


£ 
J fujdé = fdU; . 
H 


The depth-averaged momentum equations can now be summarized as follows: 
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Ws yy Wr y dU 4H _ py, p Wale _ 





ae tg U2 Az, de fU + Cd > (1.124) 
OU, OU? OU, ðE gUz|U | i 
a tn ten Ila Ot ea T. 


Together with (1.121) these form the shallow-water equations. 


For a more extended discussion of the mathematical modeling of flows of 
shallow water, see Vreugdenhil (1994). The main assumption for the validity 
of the shallow-water equations is that the water depth is much smaller than 
the horizontal length scales. This implies that the wavelength of surface waves 
should be much langer than the depth (more than 20 times as large, say, 
cf. Vreugdenhil (1994) p.20). A systematic derivation of the shallow-water 
equations using asymptotic expansions is given in Stoker (1957), Sect. 2.4. 


Gasdynamics analogy 
If we neglect Coriolis acceleration and bottom friction, take the bottom height 


H (z1, £2) = constant and give d the new name p, the shallow-water equations 
(1.121), (1.124) take on the following appearance, with a € {1, 2}: 





0 
S +(pPUk)k =0, (1.125) 
DU; 


By multiplying (1.125) by U; and (1.126) by p and adding we obtain 


OpU; 
Ot 





+ (pU;Uk),k + pgp, =0. (1.127) 


Equations (1.125) and (1.127) are equivalent to the isentropic Euler equations 
(1.76), (1.85), for ideal fluids with ratio of specific heats y = 2. This is called 
the gasdynamics analogy. 


2. Partial differential equations: analytic 
aspects 


2.1 Introduction 


As seen in the preceding chapter, fluid dynamics is governed by partial dif- 
ferential equations. Therefore knowledge of the numerical analysis of partial 
differential equations is indispensable in computational fluid dynamics. In- 
troductions to this subject of varying degree of difficulty are Fletcher (1988), 
Hackbusch (1986), Hall and Porsching (1990), Hirsch (1988), Lapidus and 
Pinder (1982), Mitchell and Griffiths (1994), Morton and Mayers (1994), 
Grossmann and Roos (1994), Strikwerda (1989), Quarteroni and Valli (1994), 
Richtmyer and Morton (1967); the last book in particular is useful for prac- 
titioners of computational fluid dynamics. 


Of course, in the study of the numerical aspects of partial differential equa- 
tions, their analytic aspects play an important role. Although the reader is 
assumed to be familiar with the basics of the analysis of partial differential 
equations, we will devote this chapter to this subject, in order to highlight 
a few important topics that receive less attention elsewhere, and that will 
play a role later. In particular, we will discuss maximum principles, in or- 
der to put discussions of numerical ‘wiggles’ (see Chap. 4) on a firm footing. 
Furthermore, we will treat those aspects of singular perturbation theory that 
are required for a thorough understanding of the convection-diffusion equa- 
tion. Also, we will pay more attention than usual to the treatment of mixed 
derivatives. These have often been neglected in the past, presumably because 
mixed derivatives seldom arise in mathematical physics. However, at present 
the predominant approach to handle arbitrarily shaped flow domains in the 
context of finite volume discretization is by means of boundary fitted coordi- 
nates (see Chap. 11). Because these coordinates are generally non-orthogonal, 
the governing equations frequently have mixed derivatives in the transformed 
coordinates. 


More advanced information on the analytic aspects of partial differential 
equations can be found in Courant and Hilbert (1989), Garabedian (1964), 
Hackbusch (1986), Kreiss and Lorenz (1989), Protter and Weinberger (1967). 


We will discuss in this chapter (special cases of} the following linear partial 
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differential equation: 


Lp = pi — (aap p p), a t baa tep =s, 


2,1 
p,s: Nx (0,T]>R, CRÍ, a,B=1,2,...,d. (2.1) 


where ¢ stands for time, p: for ĝp/ôt, and d is the number of space dimen- 
sions. Boundary conditions will be specified later. The coefficients agg, ba 
and c are given functions of t and æ E 92. 


2.2 Classification of partial differential equations 


Stationary case 


We start with assuming that ~ does not depend on time, so that the time- 
derivative in (2.1) is deleted. Furthermore, for the time being it is convenient 
to rewrite the equation as 


Le = —aapp ap + bapa +E ES, (2.2) 
which implies the following replacement; 
ba := ba — Aga,p » 
Since Y ag = Ppa We can assume without loss of generality that 
alap = aga (2.3) 


(if (2.3) does not hold, simply redefine agg = apa := (aap + apa)). Conse- 
quently, the matrix A with elements agg is symmetric and has real eigenval- 
ues. 


Three special types of partial differential equations are distinguished, accord- 
ing to the following definition. 


Definition 2.2.1. Classification of partial differential equations. 
Equation (2.2) is called 


(i) elliptic in æ if the eigenvalues of A are nonzero and have the same sign; 

(ii) hyperbolic in æ if the eigenvalues of A are nonzero and precisely one 
eigenvalue has sign different from all others; 

(iti) parabolic in x if precisely one eigenvalue is zero, while the others have 
the same sign and Rank(A, b) = d, with b the vector with elements ba. 


In the elliptic case the symmetric matrix A is definite. Without loss of gen- 
erality we assume in this case A to be positive definite, because if this is not 
the case we may reverse sign in (2.2). 


It can be shown that the type of a partial differential equation is invariant 
under coordinate transformation (see Hackbusch (1986) Sect. 1.2). Note that 
there are undefined cases. 
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Nonstationary case 


The nonstationary case 0y/dt + Ly = s can be subsumed under the classifi- 
cation for the stationary case Ly = s by introducing time ¢ as an additional 
coordinate z441, and extending the range of a and 8 to 1,2,...,.d+ 1. The 
result is the following change in A and b: 


where the subscript s refers to the stationary case. Note that if the stationary 
case is elliptic, then the nonstationary case is parabolic. 
Two-dimensional stationary case 


In this case we find that equation (2.2) is 
(i) elliptic in æ if 


a11022 — aj, >0, (2.4) 

(ii) hyperbolic in æ if 
a11822 — a’, < 0 ` (2.5) 

(iii) parabolic in æ if 
aiiz — a’, =(0 and Rank ( a a A ) =2, (2.6) 


The archetypes of the three cases are 


-Paa =O, (2.7) 
(Laplace’s equation, elliptic), 
-pu tpz =l, (2.8) 
(the wave equation, hyperbolic), 
Yi -¥,22=90, (2.9) 


(the diffusion equation, parabolic). 
The three types require different boundary conditions (more on this later) 
and different numerical methods. 


The coefficients and hence the type depend on æ, so that the type may change 
in §2. A classic example is the Tricomi equation: 
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—¥,11 T T222 = 0. 


This equation is elliptic for x2 > 0 and hyperbolic for zz < 0; for z = 0 
the type is not defined. This situation arises in transonic flow, where we have 
ellipticity where the flow is subsonic and hyperbolicity where it is supersonic. 


Example 2.2.1. Equation of undefined type 
Consider 


What is the type of this equation? It is left to the reader to verify, that the 
matrix to consider is 


010 
A=|{100 
000 
The eigenvalues are 0, +1, so that the above equation is unclassified. o 


Physical significance of classification 


The classification corresponds to different types of qualitative behavior of 
solutions, and hence to differences in the underlying physics. This can be 
shown as follows. A plane wave solution is given by 


p=, waæ)=n-z—b, 


where the function w(æ) describes the wave fronts or characteristic surfaces, 
given by w(æ) = constant. Substitution in (2.2) gives (in order to bring out 
the physical differences between the three cases it suffices to restrict ourselves 
to ba =€ = 8 = Îl, dag = constant) 


GapnaNng = 0 $ (2.10) 


with n = gradw the normal to the wave front. It is convenient to rewrite 
(2.10) as 


n?An=0. (2.11) 


Let v'%), a = 1,2,...,d be the set of orthonormal eigenvectors of the matrix 
A, with corresponding eigenvalues Aa. Because A is symmetric, v{*) and Aq 
are real. Let us write 


n = cavl® , 


Substitution in (2.11) gives 
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oc? =0. (2.12) 


If (2.2) is elliptic then A, > 0, which contradicts (2.12), so that there is no 
solution for n; hence wave-like behavior of the solution is not to be expected. 
If àa > 0 then A is positive definite and vice-versa, so that 


Gapeaks >0, Va Æ 0. (2.13) 
This is often taken as the definition of ellipticity of (2.2). 


In the hyperbolic case one eigenvalue, say 41, is negative, and solutions are 
obtained with 


oF = (So Aac?) /À (2.14) 


a>l 


so that wave-like solutions exist. Furthermore, n can point in any direction, 
since there are d — 1 free parameters in (2.14); hence, wave propagation may 
take place in any direction. In the parabolic case, let A, be the zero eigen- 
value. The only solutions of (2.12) are c1 £ 0, ca = 0, a > 1, so that wave 
propagation can take place in one specific direction only, corresponding to a 
time-like variable. 


Another way to illuminate the physical significance of the classification is to 
look at the role of the boundary conditions. As we will see, in the elliptic case 
boundary conditions have to be prescribed all along the boundary of the do- 
main, and a local change in boundary data influences the solution everywhere. 
One might therefore say that elliptic equations model equilibrium phenom- 
ena. In the hyperbolic case a local change in boundary data propagates its 
influence only in part of the domain, showing a wave-like behavior, so that 
hyperbolic equations model propagation phenomena. In the parabolic case, 
as we just saw, invariably a time-like independent variable can be identified. 
Changes in boundary data at a certain time make themselves felt everywhere, 
but only at later times. The time-like variable cannot be reversed without 
jeopardizing the well-posedness of the problem, as will be seen. Note that 
equations (2.7) and (2.8) are invariant under the transformation £a := —Za, 
but that (2.9) is not invariant under the transformation zı := —2z,. This irre- 
versibility is why the variable concerned is called time-like. In short, parabolic 
equations model diffusion processes. 


First order systems 


Extension of the classification to equations more general than (2.2) can be 
done on the basis of the criterion whether or not wave-like solutions exist. It 
is always possible to write a system of partial differential equations as a first 
order system of m equations with m unknowns: 
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Fol, + FoU a =Q, (2.15) 
with U = (U1, ..., Um) and where Fo(t,æ, U), Fa(t,2,U) are m x m matrices. 


We consider the case where Fo and Fa are made constant by ‘freezing’ them 
to their local value at t = tp, æ = æp,U = Up. We now ask whether plane 
waves can be solutions of the homogeneous version of (2.15). Plane waves are 
solutions of the following type: 


a 


U = Úe te) Ü = constant, w(t,2)=n-2—At. 


This wave travels in the direction n with velocity à/|n|. Substitution shows 
that plane waves are solutions if 


a 


(naFa —AFo)U = 0. (2.16) 


Here n € R’ is real. This is called a generalized eigenvalue problem. Eigen- 
solutions U may exist if \ and n satisfy some constraint: 


A= A(n). 


The classification of (2.15) in {tp, £p, Up} depends on the number of plane 
waves that exist. The classification is as follows. 


Definition 2.2.2. The system (2.15) is called hyperbolic in (tp, ®p, Up) if 
all eigenvalues à = A(n) of (2.16) are real and if there exist m linearly 
independent eigenvectors. 


If, on the other hand, no real eigenvalues exist at all, then wave-like solutions 
are not possible, and the system is called elliptic: 


Definition 2.2.3. The system (2.15) is called elliptic in (tp, &p, Up) if the 
eigenvalue problem (2.16) has no real eigenvalues à = A(n). 


A generalization of parabolicity to the case of first order systems is: 


Definition 2.2.4. The system (2.15) is called parabolic in (tp, &p, Up) if all 
eigenvalues à = A(n) of (2.16) are real and if there exist only m — 1 linearly 
independent eigenvectors. 


These definitions can be applied to the stationary case by deleting A and Fo, 
and by checking whether n is real or not, and by counting the eigenvectors. 


If the system (2.15) does not fall into one of these three categories it is 
called hybrid. Because higher-order systems can be reformulated as first order 
systems, this classification scheme can be generally applied. 


Example 2.2.2. Second order equation 
Consider the following special case of (2.2): 


—Aap fF, ap = 0. (2.17) 
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This is equivalent to the following first order system: 
Pra = Ux, —üapUa,p = 0 . (2.18) 


In order to obtain a system of type (2.15) we eliminate p by differentiation 
and combination: 


Pad— Pda =l, a =1,2,. d- 1, 
and we obtain the following system of d equations for d unknowns: 


Uad — Uda = 0, a = 1, 2, wid—1, 
= 0. 


on Ue (2.19) 


The reader may verify that in the case of Laplace’s equation and d = 2 
equations (2.19) are the Cauchy- Riemann equations: 


U12 -U21 =0, Uaa =0- 


These equations are equivalent to Laplace’s equation, but for d > 2 equations 
(2.19) and (2.17) are not equivalent. Solutions of (2.17) are solutions of (2.19), 
but not vice-versa. For example, for d = 3 a solution of (2.19) is obtained with 
Uz = 0, Ua = Ua(z1, £2), a = 1,2 and —aapUa, g = 0. However, in general 
this will not satisfy U1,2 = U2,1, so that we cannot solve for p from (2.18). 
In other words, the transformation (2.18) introduces spurious solutions. 


By comparing (2.19) and (2.15) and specializing to d = 3 we find Fo = 0 and 


0 0 1 0 0 0 
Fi = — 0 0 0 , P=- 0 0 1 ; 
@11 421 431 412 422 432 
—-1 0 
F3 = — 0 -1 0 


413 423 G33 


For (2.16) to have a non-trivial solution of the type e (®), w(x) = n a we 
must have det(S> ng F.) = 0, or 


n3 0 —ny 
0= 0 n3 nə = AN3 NaN Glaf - (2.20) 
—Naâlæa Nala TRal3a 


One real solution is obtained for n3 = 0, giving w = w(z1, x2). From (2.16) it 


a 2 2 a 
follows that U3 = 0, 3) SO w.gagaUg = 0, which has non-zero real solutions 
a=1f=1 


for every w(x1, £2), giving solutions of (2.17) of the form U, = Jat, a= 
1,2, U3 = 0. However, in general these will not satisfy U1,2 — U2,1 = 0, or 
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wÜ 1— wi, = 0. Hence, these are spurious solutions. The other possibility 
following from (2.20) is 


NaNsadas = 0 5 


which is just (2.11), bringing us back to the classification of second order 
systems. We see, that apart from spurious solutions, the classification of first 
order systems agrees with the classification of second order systems. g 


Example 2.2.3. Parabolic equation 
Consider equation (2.9). It can be written as a first order system by defining 


=p, Vz=9,2. 


We elemine ¢ by differentiation, as in Example 2.2.2, and obtain the following 
first order system: 


Uy. =U2, 
Uy1 —U22=0. 


This gives us the following first order system: 


00 1 0 U. 
FiUiı + FoU2 =Q, A=(19): A=(9 1): o= (F) 


The eigenproblem becomes 


ng 0 
ni —Tly 








giving n = (nı, 0) with nı arbitrary. The eigenvector U has to satisfy U, = 0, 
so that there is only one eigenvector: 


~ 0 
o=(%,) 


Hence, the system is parabolic according to Definition 2.2.4, and we find this 
classification to be in agreement with Definition 2.2.1. o 


Exercise 2.2.1. Derive the form taken by the wave equation (2.8) after the 
following change of coordinates: 


Yı = zı + £3, Y2 = £1 — £2, 


and determine the type of the resulting equation. Show that every solution 
of the wave equation can be written as 


p(x) = (zı + z2) + n(z1 — 22). (2.21) 
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Exercise 2.2.2. Show that 





plt,#) = == J po(€) exp(—(a — €)?/4t) dé 


satisfies the diffusion equation (2.9). More difficult (see Hackbusch (1986)): 
show that 


lim p(t) = o(2) . 
Exercise 2.2.3. Prove (2.4)-(2.6). 
Exercise 2.2.4. Determine, depending on the parameter c, the type of 
—p 11 + 2ey 12-22 =). 
Exercise 2.2.5. Let a coordinate transformation æ — y be given by 
z = By, 


with B a constant non-singular d x d matrix. Show that this transformation 
leaves the type of equation (2.2) invariant. Hint: use Sylvester’s lemma: If 
C = BTAB and A = AT, then the number of positive, zero and negative 
eigenvalues of A and C is the same. 


Exercise 2.2.6. Repeat Exercise 2.2.5 for equation (2.1). Next, consider the 
transformation 


with B a constant non-singular (d + 1) x (d+ 1) matrix. Show that the type 
of equation (2.1) is invariant. 


2.3 Boundary conditions 
We start with the stationary case of (2.1): 


Lg = — (aap P p), a + bap a +ey =s, (2.22) 


and assume this equation to be elliptic for all æ € 92. 
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Well-posed problems 


To ensure the existence of a unique solution, suitable boundary conditions 
have to be imposed at the boundary 02 of 92. In order to be amenable to 
computation in the presence of rounding and truncation errors, and in general 
also in order to correctly model physical phenomena, the solution p should 
depend continuously on the data, i.e. on the right-hand-side s and on the 
prescribed boundary values. In other words, small perturbations in the data 
should not cause large changes in the solution. This leads us to the concept 
of well-posed problems. Let the boundary condition be given by 


Boe=f on ôN (2.23) 


with B some operator, typical examples of which are given in (2.26)-(2.28), 
or B could be the zero operator on part of 092, in which case no boundary 
condition is prescribed there. Let 3 y, S 35 s and F 5 f be suitable function 
spaces. Then the concept of well-posedness can be defined as follows: 


Definition 2.3.1. Well-posedness 
The problem specified by (2.22) and (2.23) is called well-posed if for all 
f € F,s€S the following two conditions are satisfied: 


(t) There exists a unique solution p € ®; 
(ii) For every two sets of data fy, s1 and f2,sq in F and S the corresponding 
solutions pı and p3 satisfy 


l1 — valle < Cif- fall + ||s1 — sails} (2.24) 
with C some fixed constant. 
For mathematical precision, the function spaces $, F and S have to be speci- 


fied (their choice depends on the problem, notably on the smoothness of 0{2), 
but the general idea is clear. 


The following three types of boundary conditions lead to well-posed elliptic 
boundary value problems, assuming 


c>0 (2.25) 


(we will not prove this; there is a huge amount of literature on this subject): 


y=f on ON (Dirichlet) , (2.26) 
Notes ?,e =f on ON (Neumann) , (2.27) 
Natap?,s tap=f, a>0 on N (Robin), (2.28) 


with n the outward unit normal on ôN. Instead of (2.27) often the condition 
NaY,a = f is given as Neumann condition, but physics always leads to (2.27); 
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if dag = 0 for 8 # a the two versions are equivalent. 


When 2 is simply connected and 092 and the data f, s satisfy suitable smooth- 
ness conditions, then (2.26) and (2.28) are known to result in a well-posed 
problem. This is also the case if 02 is divided in segments, in each of which 
we have precisely one of the conditions (2.26)-(2.28), with (2.26) or (2.28) in 
at least one of the seqments. 


Compatibility condition 


In the case of a pure Neumann boundary value problem the situation is more 
complicated if c = 0 in (2.22). Obviously, if a solution exists an arbitrary 
constant can be added to it. Usually b = 0 in this situation. Assuming this, 
integration of (2.22) over 2 and application of the divergence theorem gives 


fdS =- | sdQ. (2.29) 
J ns=-] 


on n 


If and only if this so-called compatibility condition is satisfied, solutions exist. 
They can be stably computed, and the problem is still considered to be well- 
posed, although we have no uniqueness in the strict sense. Equation (2.29) 
expresses a physical conservation principle: transport through the boundary 
balances production in the interior. 


We illustrate these theoretical considerations with some examples. 


Example 2.3.1. Hadamard’s problem 
Let N = (0,1) x (0,1), and consider the following problem: 


—Y aa = 0 in n 5 
with boundary conditions 


(xı, 0) = 0, ~p,2(%1,0) = f(z1), (2.30) 
y(0,22) = p(l, x2) = 0. (2.31) 
Notice that these conditions are not of the type discussed before, since we 


have two conditions at the seqment zz = 0 of 092, and none at z2 = 1. With 
f(zi)=- 4 sinh mx ,, separation of variables gives the following solution: 


1 . 
p(æ) = —; f (#1) sinh mrga. 
nm 
With the maximum principle we can establish uniqueness. Nevertheless the 


problem is ill-posed, which can be seen as follows. If f(x) = 0 the solution 
is p(x) = pı({æ) = 0. With f as above, we have p(z1,1) = p2(z1,1) = 
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—1, f (#1) sinh mm. Given an € there is an M such that |f(21)| < € for all 


m > M. However, due to the exponential growth of sinh we can for arbitrarily 
large A always find an m > M such that |pı(z1, 1) — ¢2(r1, 1)| > K, so that 
(2.24) does not hold. 0 





Example 2.3.2. Two-dimensional potential flow 
Let 2 = {æ € R*: taza > 1}, ie. the exterior of the unit circle. Consider 
the following problem: 


-Paa 70 in N, 

Napa =0 at Lata =1, 
lim y/r1=U. 

|| + 00 

This describes potential flow around a cylinder with free-stream velocity U 

in the z-direction at infinity. At first sight it seems that a difficulty is the 

prescription of an infinite value for y at infinity, but this difficulty is easily 

surmounted by a change of the dependent variable: y = y — Uri. Using 

a conformal mapping method, well-known in classical fluid dynamics, the 

following one-parameter family of solutions is found: 


yp = U(r + 1/r)cosé + 0/27 , 


with (r,@) polar coordinates, and y an arbitrary constant. The physical mean- 
ing of y is that it governs the amount of circulation, and in fluid dynamics 
it is determined by an additional condition, called the Kutta condition. The 
reason that uniqueness is lacking here is that $2 is not simply connected. O 


Example 2.3.3. Helmholtz equation 
Let 2 = (0,7) x (0,7), and consider 


-P aa — àX’ p = 0 in 2, y=O0 on ON. 


Here (2.25) is violated. A solution is obviously p = 0. For A = 1,2,3,... 
(eigenvalues) we also have non-zero solutions (eigenfunctions) given by 


p= sin AL sin AX 5 


so that there is no uniqueness. Although this problem does not satisfy our 
definition of well-posedness, it is meaningful and can be stably computed as 
an eigenvalue problem. g 


The parabolic case 


Let equation (2.1) be parabolic. Then a well-posed problem is obtained if an 
initial condition is given: 


y(0,#)=g(#), rEN 
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and, furthermore, boundary conditions on ôN that would fit an elliptic prob- 
lem, as given by (2.26)—(2.28), with in this case f = f(t, x). If singularities 
are to be avoided, initial and boundary conditions should agree at t = 0 and 
x € ðN. Note that at t = T no condition is to be given. 


Consider the following pure initial value problem for the diffusion equation 


(2.9): 
Y-~u=0, ER, p(0,z1)= f(21), O<t<T. (2.32) 


The solution is given by: 
1 oa 
t, = = Xpi— — y}? At\d ; 
elt ei) = 5 =i f twe p{—(x1 — y)"/4t}dy 


which is easily verified. Clearly, for t < —1 the solution is very sensitive to 
perturbations in f, so that the problem is not well-posed. This means that 
the condition ¢ > 0 in (2.32) is essential: time cannot be reversed. This corre- 
sponds to the intuitive notion that from a smooth temperature distribution 
the corresponding, perhaps non-smooth, temperature distribution at a suf- 
ficiently removed earlier instant of time cannot be stably determined. Note 
that for all f, 


00 


plo, 21) = constant = J Flyjdy . 


This irreversibility of time is a hallmark of parabolic problems. 


Example 2.3.4. Backward solution of the diffusion equation 
Consider the diffusion equation 


yr-%i=0, O<t<T, 2, € (0,1), 


with boundary conditions 


y(t, 0) = y(t, 1) = 0 


and initial (or rather ‘final’) condition 
l, 
(T, z1) = — sin mrzı. 
m 
By separation of variables the following solution is obtained: 
_ i 227 _ 
p = m exp(m n (T — t)) sin mrga. 


so that 
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1 
y(0, x1) = — exp(m? r? T) sin mrz; , 
m 


which can be made to differ from zero by an arbitrary amount by choosing 
m large enough. By an argument similar to that employed in Example 2.3.1, 
we see that the problem is ill-posed. g 


The hyperbolic case will not be discussed here, but in Chap. 8. For the wave 
equation (2.8) one can determine which boundary conditions lead to a well- 
posed problem by requiring that the two functions ọ and 7 in the general 
representation (2.21) are determined uniquely. Illustrations are given in the 
following exercises. 


Exercise 2.3.1. Using (2.21), show that boundary conditions (2.30), (2.31) 
lead to a well-posed problem for the wave equation (2.8). 


Exercise 2.3.2. Let £2 = (0,1) x (0,1), and consider the wave equation (2.8) 
with a Dirichlet condition prescribed at all of 02. Using (2.21), show that in 
general a solution does not exist. Note that according to the theoretical results 
presented earlier, this boundary condition leads to a well-posed problem for 
the Laplace equation. 


2.4 Maximum principles 
Physical interpretation 


In this and the following section we discuss qualitative properties of the 
solution of (2.1), giving a-priori information that can be used advantageously 
in the development of numerical approximations. It is assumed that (2.13) 
holds, so that (2.1) is parabolic. An intuitive idea about the behavior of 
solutions of (2.1) is obtained by associating with this equation a physical 
interpretation. For example, (2.1) models the temperature distribution in a 
fluid with temperature p, heat source distribution q, velocity field u and heat 
conduction tensor aag. If q < 0, no heating takes place and it is intuitively 
clear that if p has a local maximum pm at (tm,@m), then at t < tm a value 
P > Pm is to be found somewhere. Furthermore, large values of y may be 
imposed on 092 by a Dirichlet boundary condition. Hence, we arrive at the 
following hypothesis: local maxima can occur only for t = 0 and/or æ € ôN. 
Such a marimum principle can be very useful. We now give it a mathematical 
basis. More background may be found in Protter and Weinberger (1967) and 
Sperb (1981). 
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The one-dimensional stationary case 


In this case equation (2.1) may be written as, taking c = 0, writing D instead 
of aj; and writing u instead of b, 


u— ——(D—)=s, x € (a,b). 
Suppose » has a local maximum in x = £m € (a,b). Then dy(z,,)/dx = 0, 
and d?p(zm)/dx? < 0. Assuming D to be differentiable, we find 
2 
dip pf y dDdy 


s = u— — D—~ - —-— 


dz dz? dz dx’ 


so that s(x) > 0. It follows that if s < 0, then y cannot have a local maxi- 
mum in (a, b). This result is strengthened in the following theorem. 


Theorem 2.4.1. One-dimensional marimum principle 
Let p satisfy 


u— ~ —(D—)<0, x€(0,1), (2.33) 


with D > 0 differentiable and bounded and p not constant. Then p has a 
local mazimum only in zg = 0 and/or z = 1, and dg/dn > 0 in a boundary 
point with a local marimum, with dp/dn the outward derivative in x = 0 or 
z=1. 


Proof. (cf. Sperb (1981)). We have 
_ dp dọ 
Le =v T 73 SO r € (0,1), 


with v = u/D — dìn D/dz. Assume y(c) = M, 0 < ¢ < 1, isa local 
maximum. Let (a,b) C (0,1) be a neighbourhood of c in which p(z) < M. 
Because ¢ is not constant it is possible to choose a and b such that g(a) < M 
and/or (b) < M. Assume y(b) < M; if not, the following argument is easily 
repeated using y(a) < M. Then there is a point z = d, c < d < b with 
y(d) < M. Let 


pe) = et") 1, 
with a > 0 a constant still to be determined. We have 
Ly = (av — a? )e%?-9) 
Since v is bounded we can choose a such that Ly < 0 in (a,d). Let 


(2) = oz) +ev(z) , 
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with £ a constant such that 


e€ < (M — (d))/¥(d). 


It is easily seen that 6(x) < M in (a,c), O(c) = M, @(d) < M. This means 
that 6(x) has a maximum M > M in (a,d). Furthermore, 


L9 = Lp +eLy<0 in (a,d). 


But then @ cannot have a maximum in (a, d), according to the result which 
preceeds the theorem, and we have a contradiction. Hence, y cannot have a 
local maximum in (0, 1). It remains to prove the property of dp/dn. Suppose 
that y(0) = M and p(x} < M, z E€ (0,1), with y(d) < M for some d € (a,b). 
Now define 


y(r) =e —1, 


and choose œ > 0 such that Ly < 0 in (0, d). Defining 6(2) and £ as before 
we have Lô < 0 in (0,d) and the maximum of 8 in [0, d] must therefore occur 
at x = 0 or x = d. But 6(0) = y(0) = M > 6(d) because of our choice of £. 
Therefore we have 


46(0) _ dp(0) , dv(0) _ dy(0) 











de dx« “dr dz tea <o, 
which implies 
dp(0) _ _dẹ(0) 

dù — dz >0 
The procedure is similar if y(1) = M. o 
In a similar way we can prove that if ọ satisfies 

dp d , dọ 
“Te zP) =" z € (0,1), 


and if the other conditions are satisfied, then y has a local minimum only in 
z = 0 and/or z = 1, and dy/dn < 0 in boundary points with a local min- 
imum. Hence, in the frequently occurring case that the equality sign holds 
in (2.33), there can be no local extrema in (0, 1). This means that inte- 
rior extrema occurring in numerical solutions, often called numerical wiggles, 
are numerical artifacts. Furthermore, if we have a homogeneous Neumann 
boundary condition 


dyp(0) 
dn 





= 0 


then there can be no local extremum at x = 0, and similarly in the case of 
such a boundary condition at z = 1. 
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The general stationary case 


We rewrite (2.2) with e = 0 as 
UaY,a — GapY,ep =8, BENC Ri, (2.34) 


with in every point of @{2 precisely one of the boundary conditions (2.26)- 
(2.28). By saying that 092 is smooth we mean that for every æ € OS? there 
exists an open sphere in {2 that is tangent to 092 in æ. The following theorem 
holds: 


Theorem 2.4.2. (Mazimum principle) 

Let (2.84) hold with s < 0, let any outward derivative ĝðp/ðv < 0 on ag c 
82, and let AQ be smooth. Then local mazima occur only on 82 \ A, 

p = constant in £. 


Proof. See Protter and Weinberger (1967) Sect. 2.3 Theorems 5 and7. O 


By reversing sign we obtain a similar minimum principle. Note that Nadas P,e 
(cf. (2.27)) is an example of an outward derivative. In the frequently occurring 
case that s = 0 anda homogeneous Neumann condition 0¢/0n = 0 on ðN, 
both maxima and minima, i.e. extrema, can only occur on 8N \ 02. Should 
extrema occur in §?2 in a numerical solution, then we know that such extrema 
are (undesirable) numerical wiggles. Discretizations that satisfy a maximum 
principle similar to Theorem 2.4.2, and hence exclude numerical wiggles, are 
called monotone schemes. Monotone schemes will be developed later. The 
maximum principle can be used in the derivation of global error estimates, 
that we also will present later. 


The general nonstationary case 
Consider the following version of (2.1): 


Pt t+ Uefa —GopPag=a, DEN, 0<t<T. (2.35) 


We have 


Theorem 2.4.3. (Mazimum principle) 

Let s < 0 in (2.95), let any outward derivative ĝðp/ðv < 0 on ƏN C AQ, 
and let AQ be smooth. Then if a local mazimum occurs at t = t* > 0 in the 
interior of 2 or on OQ, then p = constant, 0< t< t*. 


Proof. See Protter and Weinberger (1967) Sect. 3.3, Theorems 5 and 6. O 
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Hence if p £ constant, local maxima can occur only at t = 0 and portions of 
the boundary where 0y/dv > 0. 


We have formulated maximum principles for elliptic and parabolic partial 
differential equations. For hyperbolic equations there are maximum principles 
of a different type that will not be discussed; see Protter and Weinberger 
(1967). 


2.5 Boundary layer theory 


Consider the following version of (2.1): 
t+ Uap a — EP aa =S, BEM, O<t<T, e>0. (2.36) 


Boundary layer theory or singular perturbation theory studies what happens 
when € | 0. Equation (2.36) is parabolic, and we prescribe initial and bound- 
ary conditions in accordance with the rules of Sect. 2.3. 


Let the following initial condition be given: 
(0, æ) = po(æ) . 


At every point of 092 precisely one of the three types of boundary condition 
(2.26)—(2.28) is precribed. 


The convection equation 


When £ < 1 it is natural to approximate (2.36) by putting € = 0. For 
simplicity we also take s = 0, and obtain the convection equation: 


Pt + uapa =0. (2.37) 


This equation is hyperbolic. Solutions cannot. satisfy all of the initial and 
boundary conditions, as will be seen shortly. 


Characteristics 


Let us define curves in (t, æ) space by relations t = t(s), æ = a(s), satisfying 


dt dra 
ds = 1, ds = ux. (2.38) 


Then equation (2.37) reduces to 
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dp 
—=0. 2. 
T; 0 (2.39) 


Since the curves defined by (2.38) are particle paths, equation (2.39) expresses 
the fact that y is constant along particle paths. These curves are called 
characteristics in the theory of hyperbolic systems. 


One-dimensional case 


Although the one-dimensional case is very simple, we will dwell a bit upon 
it, in order to elucidate the principles of singular perturbation theory. These 
principles will be applied subsequently in a more general setting. 


Let 92 = (0,1). Equation (2.36) reduces to, taking s = 0, 
Y+uyi-epiu=0, O<a<l, O<i<T. (2.40) 
We assume the following initial and boundary conditions: 


y(0,21) = golai), O<ai<l, (2.41) 
y(t,0) = folt), et l=fAlt), O<t<T. (2.42) 


For £ < 1 one would expect that equation (2.40) can be approximated by 
pı +u =D. (2.43) 


Solutions of (2.43) are constant along the characteristics. Figure 2.1 gives a 
sketch of the characteristics of (2.43) (as defined by (2.38)), assuming u1 < 0. 
Consider the characteristic C1. On Ci, » = y(C1) = constant. In P, we have 





o 


Fig. 2.1. Characteristics for equation (2.43). 


the initial condition (2.41), in Pz the first of the boundary conditions (2.42). 
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Only one of them can be satisfied in general. What value to take for (C1) 
such that the solution of (2.40)—(2.42) is approximated? The difficulty has to 
do with the change of type that (2.36) undergoes when <£ is replaced by zero. 
For £ = 0, (2.36) is hyperbolic; for € > 0, it is parabolic. 


Singular perturbation theory 


The answer to the foregoing question is provided by singular perturbation 
theory. Introductions to this subject are given in Eckhaus (1973), Kevorkian 
and Cole (1981) and Van Dyke (1975). In the present case, p = (C1) = 
(Pı) is a good approximation for € < 1 to the solution of (2.40)—(2.42) in 
1 > zı > 6 = O(e), whereas (2.43) has to be replaced by a so-called boundary 
layer equation to obtain an approximation in ô > z; > 0. This can be seen as 
follows. First, assume that we indeed have y(C1) = (P1) in 1 > zı > 6 with 
ô & 1. In ô > zı > 0 we expect a rapid change of p from y(P,)} to (P2). 
For derivatives of p we expect 


ap 
dx? 





= O(6-") , (2.44) 


so that perhaps the diffusion term in (2.40) cannot be neglected in the bound- 
ary layer; this will depend on the size of 6. Assume 


ô = Ole?) , (2.45) 


with a to be determined. In order to exhibit the dependence of the magnitude 
of derivatives on £ we introduce a stretched coordinate 2: 


Ly = rye | A (2.46) 


which is chosen such that zı = O(1) in the boundary layer. It follows from 
(2.44)-(2.46) that 





= O(1) (2.47) 
in the boundary layer. In the stretched coordinate equation (2.40) becomes: 


— Op 1-20 ap 
Q, e Ė5— —— = . 2.4 
pi +e “uaa —e oF 0 (2.48) 
Letting € | 0 and using (2.47), equation (2.48) takes various forms, depending 
on a. The correct value of a follows from the requirement, that the solution of 
the e | 0 limit of equation (2.48) satisfies the boundary condition at zı = 0, 
and the so-called matching principle. 
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Matching principle 


As #1 increases, the solution of (the e | 0 limit of) equation (2.48) has to 
somehow join up with the solution of (2.43), i.e. approach the value (C1). 
In singular perturbation theory this condition is formulated precisely, and is 
known as the matching principle: 
lim inner (t, #1) = lim outer (¢, 2) . 
T 1—00 zı40 
Here Pinner: also called the inner solution, is the solution of the inner equation 
or boundary layer equation, which is the limit as € | 0 of equation (2.48) for 
the correct value of a, which we are trying to determine. Furthermore, Youter: 
also called the outer solution, is the solution of the outer equation, which is 
the limit as £ | 0 of the original equation, i.e. equation (2.43). In our case 
the matching principle becomes 
lim Pinner (f, 21) = (C1) - (2.49) 
oe it O°) 
As already mentioned, the other condition to be satisfied is the boundary 
condition at x, = 0: 


p(t, 0) = folt) . (2.50) 


For a < 0 (corresponding to compression rather than stretching) the limit as 
E€ | 0 of (2.48) is 


o, = 0. (2.51) 


Obviously, the solution of (2.51) cannot satisfy (2.50), so that the case a < 0 
has to be rejected. With a = 0 equation (2.43) is obtained, which cannot 
satisfy both conditions at t = 0 and x; = z; = 0, as we saw. 


For 0 < a < 1 the limit of (2.48) is 


ul? = 
‘Or 


so that the inner solution is given by 


0, 


p(t, 21) = g(t) } 


and in general equations (2.49) and (2.50) cannot be satisfied, so that we 
cannot have 0 < a < lI. 


For a = 1 equation (2.48) becomes as £ | 0: 


2 
uz(t,0)—— — =z =D, (2.52) 
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where we have used that u(t, z) = ui(t,e£1) —> u(t, 0) as £ | 0. The general 
solution of (2.52) is 


p = A(t) + B(t)e"™ , (2.53) 


with u = ui(t,0). We can satisfy both (2.49) and (2.50), remembering that 
we had assumed u, < 0, with 


A(t) = (Ci), B(t) = folt) — (C1) . 


This gives us the inner solution. In terms of the unstretched variable xı the 
inner solution is given by 


p = (Ci) + {filt) — p(Cl) fet" 


We see a rapid exponential variation from fı(t) to (C1) in a thin layer 
of thickness 6 = O(e), confirming our earlier statement about the behavior 
of the solution. Fig. 2.2 gives a sketch of the inner and outer solutions as a 
function of z1. An asymptotic approximation for £ | 0 that is valid everywhere 
is given by Pinner + Youter — Y(C1) (not shown in the figure). 


F? inner 





Fig. 2.2. Sketch of inner and outer solutions 


The distinguished limit 


The limit as £ | 0 of the stretched equation (2.48) for the special value of 
a = 1 for which the solution of the resulting inner equation can satisfy both 
the boundary condition and the matching principle is called the distinguished 
limit. In order to show that this limit is unique we will also investigate the 
remaining values of a that we did not yet consider, namely a > 1. Now 
equation (2.48) gives the following inner equation: 
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with the general solution 
p= A(t) + B(t)žı . 


The limit of y as žų — co does not exist, so that the matching principle 
cannot be satisfied. Hence, a = 1 is the only value that gives a distinguished 
limit. 


The only element of arbitrariness that remains in this analysis is the assump- 
tion that we have a boundary layer at xı = 0. Why no boundary layer at 
t = 0, and y(C1) = »(P2) (cf. Fig. 2.1)? This can be investigated by assum- 
ing a boundary layer at t = 0, and determining whether a distinguished limit 
exists or not. This is left as an exercise. It turns out there is no boundary 
layer at t = 0. Hence, singular perturbation theory determines uniquely the 
asymptotic behavior of the solution along the characteristic C4. 


It is left to the reader to verify in a similar way that along characteristic C2 
(cf. Fig. 2.1), which does not originate in t = 0 but at the boundary zı = 1, 
we have (C2) = y(P3), except in a boundary layer with thickness O(c) at 
zı = 0. When wu; > 0 there is a boundary layer at x; = 1, and no boundary 
layer at zı = 0. We see that boundary layers will not be present at inflow 
boundaries (i.e. parts of the domain boundary 092 where the flow enters the 
domain, that is where u-n < 0 with n the outward normal on 0/2). This 
holds in any number of space dimensions. 


The role of boundary conditions 


The occurrence of boundary layers is strongly influenced by the type of 
boundary condition. Let (2.42) be replaced by 


“~~ = f(t), plt, 1)= fit), 0<t<T. (2.54) 


As before, a boundary layer of thickness O(c) is found at zı = 0, and 
the boundary layer equation is given by (2.52), with solution (2.53). Tak- 
ing boundary condition (2.54) into account we find 


Bit) = efo(t)/u , 


so that B(t) > 0 as e | 0. Hence, to first order, there is no boundary layer, 
and the outer solution (solution of (2.43)) is uniformly valid in 0 < zı < 1. 


76 2. Partial differential equations: analytic aspects 


Two-dimensional case 


Let {2 = (0,1 x (0,1). If we take s = 0, and consider the time-independent 
case for brevity, equation (2.36) reduces to 


Yala — EP aa =0, æ € (0,1) x (0,1). (2.55) 
Assume Dirichlet boundary conditions: 


y(21,0) = fi(x1), p(l, 22) = fo(x2), y(a1,1) = falar), 
p(0, £2) = fa(xe). (2.56) 


For £ | 0 we obtain the following outer equation: 
tala = 0. (2.57) 
This is a hyperbolic equation, with characteristics defined by 


dz o 
ds 





=U,, a=1,2. (2.58) 


For a sketch of possible characteristics we can reuse Fig. 2.1 by replacing 
t with z2. Along Ci, p = (C1) = constant. Again the question arises, 
whether y(C;) will take on the prescribed value y(Pi) or y(P2). First, let 
us postulate a boundary layer at zı = 0. Reasoning as before, we find the 
following boundary layer equation: 
ð dy ` 
uE — an? =0, #1 = zı/E, u= u (0, z2). (2.59) 
The following solution satisfies both the boundary condition at 2; = 0 and 
the matching principle, provided u < 0: 


p(ž1, £2) = p(P1) + {f4(22) — p(Pr)}e*™ . 


In case we have a Neumann boundary condition at zı = 0 there is no bound- 
ary layer at x; = 0. As in the one-dimensional case, when x, = 0 is an inflow 
boundary {u > 0) there is also no boundary layer at x, = 0. 


Parabolic and ordinary boundary layers 


Next, consider the boundary zz = 0. If it is an outflow boundary, i.e. v = 
u2(#1, 0) < 0, then there is again a boundary layer of thickness O(e) in case 
of a Dirichlet boundary condition. The boundary layer solution is found to 


be 


(21, #2) = p(P2) + {fi(21) ~ p(Pa)}e"*?, #2 = a2/e. (2.60) 
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xy 


0 1 
Fig. 2.3. Characteristics of equation (2.57) in a channel flow 


Now consider the case that z = 0 is a solid wall, so that v = 0. The shape of 
the characteristics of the outer equation (2.57) might be as in Fig. 2.3, where 
also 22 = 1 is assumed to be a solid wall, so that we have a channel flow. 
Since u2(2#1,0) = 0, the curve #2 = 0 is a characteristic of the outer equation 
(2.57) according to (2.58), so that the solution along this characteristic is 
given by 


(21,0) = fa(0) (2.61) 
if z; = 0 is a inflow boundary, or by 
p(x1,0) = f2(0) (2.62) 


if x4 = 1 is an inflow boundary. To verify this using singular perturbation 
theory is left as an exercise. If both zı = 0 and z, = 1 are inflow boundaries 
the pattern of characteristics has to be qualitatively different from Fig. 2.3, 
assuming that Ug, = 0 (incompressible flow field). This situation will not be 
considered here. Whether we have (2.61) or (2.62), in both cases the outer 
solution cannot satisfy boundary condition (2.56) at zz = 0. Hence, we expect 
a boundary layer at z3 = 0. Obviously, this boundary layer will be of different 
type than obtained until now, because the boundary layer solution cannot 
be given by (2.60), since now we have v = 0. In order to derive the boundary 
layer equation, the same procedure is followed as before. We transform (2.55) 
to the stretched coordinate £2 = r2€~%, with a to be determined. Equation 
(2.55) becomes, with uz = 0: 


2 2 
uO? OP _ 1-20 FP _ 9, (2.63) 


The boundary condition is 


y(r1,0) = fi(z1) , (2.64) 
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and the matching principle gives 


lim | (#1, #2) = lim po(21, 22) - (2.65) 


£g 7400 


Now we take the limit of (2.63) as £ | 0. For a < 1/2 the outer equation at 
Zz = 0 is recovered: 


dy 


ua = 
Ox 


of which the solution obviously cannot satisfy (2.64) and (2.65). For a = 1/2 
the limit of (2.63) is 


0, 


ðp Op 
“Oe, aR 8 (2.66) 


with u = u(zı) = lim ui (21, #2) = uı(zı, 0). This is a parabolic partial 
differential equation, which in general cannot be solved explicitly, but for 
which it is known that boundary conditions at Z2 = 0 and ř = œ give a 
well-posed problem. An analytic expression for the solution of the boundary 
layer equation (2.66) with u = 1 will be given in Sect. 4.7. Hence, a = 1/2 
gives the distinguished limit, and (2.66) is the boundary layer equation. The 
thickness of this type of boundary layer is O(,/e), which is much larger than 
for the preceding type. 


In order to specify a unique solution, in addition an ‘initial’ condition has to 
be specified. Assuming u > 0, this has to be done at xı = 0. From (2.56) we 
obtain the following initial condition for the boundary layer solution: 


p(0, 22) = falž2 v£) , 


which to the present asymptotic order of approximation (we will not go into 
higher order boundary layer theory) may be replaced by 


(0, ž2) = fa(0) . 


It is left to the reader to verify that a > 1/2 does not give a distinguished 
limit. 


The cause of the difference between the two boundary layer equations (2.59) 
(an ordinary differential equation) and (2.66) (a partial differential equation) 
is the angle which the characteristics of the outer equation (2.57) make with 
the boundary layer. In the first case this angle is non-zero (cf. Fig. 2.1), in the 
second case the characteristics do not intersect the boundary layer. The first 
type is called an ordinary boundary layer (the boundary layer equation is an 
ordinary differential equation), whereas the second type is called a parabolic 
boundary layer (parabolic boundary layer equation). 
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Summarizing, in the case of the channel flow depicted in Fig. 2.3, fore < 1 
there are parabolic boundary layers of thickness O(./é) at z2 = 0 and z2 = 1, 
and an ordinary boundary layer of thickness © (£) at the outflow boundary, 
unless a Neumann boundary condition is prescribed there. 


On outflow boundary conditions 


It frequently happens that physically no outflow boundary condition is 
known, but that this is required mathematically. If € = O(1) such a physical 
model is incomplete, but for € < 1 an artificial (invented) outflow condition 
may safely be used to complete the mathematical model, because this does 
not affect the solution to any significant extent. Furthermore, an artificial 
condition of Neumann type is to be preferred above one of Dirichlet type. 
This may be seen as follows. 


Consider the following physical situation: an incompressible flow with given 
velocity field u through a channel, the walls of which are kept at a known 
temperature. We want to know the temperature of the fluid, especially at 
the outlet. This leads to the following mathematical model. The governing 
equation is (2.55), with » the temperature. Assume £ < 1, and u > 0. 
We have ¢ prescribed at x, = 0 and at z = 0,1, but at xı = 1 we know 
nothing. Hence, we cannot proceed with solving (2.55), either analytically or 
numerically. Now let us just postulate some temperature profile at z; = 1: 


p(l, x2) = fa(22). 


An ordinary boundary layer will develop at zı = 1, with solution, derived in 
the way discussed earlier, given by 


(1,22) = poll, 22) + {f2(£2) — poll, z2) je” . (2.67) 


This shows that the invented temperature profile fa(x2) influences the solu- 
tion only in the thin (artificially generated) boundary layer at zı = 1. This 
means that the computed temperature outside this boundary layer will be 
correct, regardless what we take for fo(z2). When ¢ = O(1) this is no longer 
true, and more information from physics is required. In physical reality there 
will not be a boundary layer at all at zı = 1, of course. Therefore a more 
satisfactory artificial outflow boundary condition is 


dp(l, z2) 


=0, 
Oxy 


since with this Neurnann boundary condition there will be no boundary layer 
at zı = 1 in the mathematical model. 
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Exercise 2.5.1. Consider equations (2.40)-(2.42) with u, > 0. Show that 
for € < 1 there is a boundary layer of thickness O(¢) at and only at zı = 1. 


Exercise 2.5.2. Consider equations (2.55) and (2.56), with the Dirichlet 
boundary condition at xı = 0 replaced by a homogeneous Neumann boundary 
condition. Show that when u < 0 and ¢ < 1 there is no boundary layer at 
£1 = 0. 


Exercise 2.5.3. Derive equation (2.67). 


3. Finite volume and finite difference 
discretization on nonuniform grids 


3.1 Introduction 


Nonuniform grids are often used to obtain accuracy in regious where the 
solution varies rapidly. We will see that on nonuniform grids, finite volurne 
and finite difference discretization are not equivalent. On arbitrary nonuni- 
form grids the local truncation error is usually larger than on uniform grids, 
or grids on which the mesh size varies smoothly. This has sometimes led 
to confusion. Cell-centered finite volume discretization is sometimes advised 
against, because the local truncation error is larger than for vertex-centered 
finite volumes, and is in fact of the same order even as the term that is ap- 
proximated. Nevertheless, this is a good discretization method that is popular 
in reservoir engineering and porous media flow computation, and in compu- 
tational fluid dynamics in general. The source of the confusion is that the 
relation between the local and global truncation error is complicated. Sur- 
prisingly, the global truncation error is small, as we will see. Of course, it is 
the global truncation error that counts. 


In order to make the matter clear we will present a detailed study of global 
truncation errors for simple cases. This will shed light not only on the accu- 
racy of discretizations on nonuniform grids, but also on the accuracy required 
in the approximation of boundary conditions. The fact that often the local 
truncation error is larger at the boundary than in the interior has been an- 
other source of confusion in the past. We will see that even if the local trun- 
cation error is of low order at the boundary and in the interior, the global 
truncation error is still second order for cell-centered finite volume discretiza- 
tion. 


Furthermore, we will study the discretization of the diffusion equation with 
discontinuous diffusion coefficient, because there seern to be no texts giving 
a comprehensive account of discretization methods in this situation, which is 
common in reservoir engineering and porous media flow. 


Elementary introductions to finite difference and finite volume discretization 
are given in Forsythe and Wasow (1960), Mitchell and Griffiths (1994), Mor- 
ton and Mayers (1994) and Strikwerda (1989). 
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3.2 An elliptic equation 


This chapter is devoted to the discretization of (special cases of) the general 
single second-order elliptic equation, that can be written as 


Le = —(4ap¥,a),6 + (baf) a +p =q in QCR*. (3.1) 


The diffusion tensor dg, is assumed to be symmetric: dag = 4gq. Boundary 
conditions will be discussed later. We assume: 


Definition 3.2.1. Uniform ellipticity. 
Equation (3.1) is called uniformly elliptic if there exists a constant C > 0 
such that 


Aap(@)vatg > Crate, YVE R, ees. (3.2) 


For d = 2 this is equivalent to equation (2.4). Property (3.2) is invariant 
under coordinate transformations. 


The domain £2? 


The domain 2? is taken to be the d-dimensional unit cube. This is not a serious 
limitation, because the current main trend in grid generation consists of the 
decomposition of the physical domain in subdomains, each of which is mapped 
onto a cubic computational domain. In general, such mappings change the 
coefficients in (3.1). As a result, special properties, such as separability or the 
coefficients being constant, may be lost. This is the price to pay for geometric 
generality. 


The weak formulation 


Assume that dag is discontinuous along some manifold T' C §2, which we will 
call an interface; then equation (3.1) is called an interface problem. Equation 
(3.1) now has to be interpreted in the weak sense, as follows. From (3.1) it 
follows that 


(Ly,v)=(,¥), YYEH, wu) = | evar, 


N 


where H is a suitable Sobolev space, that we do not need to specify here. 
Define 


a(y, h) 
b(p, p) 


f aapt at pd — f dasy,angydl , 
KP] an 

F(a?) avd , 

n 


III 
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with ng the zg component of the outward unit normal on the boundary 02 
of §2. Application of the divergence theorem gives 


(Ly, y) = aly, p) + (yp, h) + (cy, v). (3.3) 


The weak formulation of (3.1) is 


Find p € Ë such that a(p, Y) + (y, Y) + (cv, Y) = (av), VHEH, 
(3.4) 


with H another suitable Sobolev space. For suitable choices of H ,H and 
boundary conditions, existence and uniqueness of the solution of (3.4) has 
been established. For more details on the weak formulation (not needed here), 
see for example Ciarlet (1978) and Hackbusch (1986). 


The jump condition 


Consider the case with one interface I, which divides 922 in two parts §2) 
and 22, in each of which aag is continuous. At I’, agg(#) is discontinuous. 
Let superscripts 1 and 2 denote quantities on F at the side of 21 and (22, 
respectively. Application of the divergence theorem to (3.3) gives 


alpy) =- | (aappa)ebde+ [(arp¢le—adyea)nbdl. (3.5) 
ANP r 


Hence, the solution of (3.4)satisfies (3.1) in 2/1, together with the following 
jump condition on the interface T: 


anap ang = aap ang on I. (3.6) 


This means that where aag is discontinuous, so is ~,,- This has to be taken 
into account in constructing discrete approximations. 


The pressure equation in reservoir engineering 


With b = c = 0 and a scalar diffusion coefficient (i.e. dag = 0, a # 8 and 
a11 = @22 = 433 = a) equation (3.1) reduces to 


—(AY a) ,0 =q. 


This equation plays an important role in the theory of flow in porous media, 
and closely resembles the pressure equation in the IMPES (implicit pressure, 
explicit saturation) model in reservoir engineering. Usually the domain con- 
tains interfaces across which a(x) has large jumps. For an introduction to 
the IMPES model, see Aziz and Settari (1979). 
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3.3 A one-dimensional example 


The basic ideas of finite difference and finite volume discretization taking 
discontinuities in aag into account will be explained for the following example: 


-(agi)i=9¢, 2€2= (0,1). (3.7) 


Boundary conditions will be given later. 


Finite difference discretization 


A computational grid G C 22 is defined by 
G={eeER: xv=aj;=jh, j=0,1,..,7, h=1/n}. 
Forward and backward difference operators are defined by 
Ay; = (pis — ei)/h, Vei = (pi = 5-1)/F - 
A finite difference approximation of (3.7) is obtained by replacing d/dz, by 
Aor V. A nice symmetric formula is 


-5{V(0A) + A(aV)}—j =93. J=lenn—l, (3.8) 


where q; = q(z;) and pj is the numerical approximation of y(x;). Written 
out in full, equation (3.8) gives 


{—(a;-1 + 4) pj-1 + (aj—1 + 2a; + aj 41); — (aj + aj 41); 41}/2h" = qj ; 
j=l,..,n-1. 
(3.9) 


If the boundary condition at z = 0 is y(0) = f (Dirichlet), we eliminate 
po from (3.9) with go = f. If the boundary condition is a(0)p, (0) = 
f (Neumann), we write down (3.9) for j = 0 and replace the quan- 
tity —(a-1 + ao)p-1 + (a-1 + ao)po by 2hf. If the boundary condition is 
c19,1(0) + c2y(0) = f (Robin), we again write down (3.9) for j = 0, and 
replace the quantity just mentioned by 2h(f — c29)a(0)/e1. The boundary 
condition at x = 1 is handled in a similar way. 


An interface problem 


In order to show that (3.8) can be inaccurate for interface problems, we 
consider the following example: q = 0, and 


a(jz)=e, 0<z<zr*, afz)=1, z*<r<l. 
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The boundary conditions are: (0) = 0, (1) = 1. The jump condition (3.6) 
becomes 


e lim Pits lim Pi: (3.10) 


rtr* 


By postulating a piecewise linear solution, the solution of (3.7) and (3.10) is 
found to be 


p = az, 0<zr<zr*, po=ear+l—ea, *<r<1, 
Z (3.11) 
a = 1/(x* —er* +e). 
Assume 2, < 2* < £k+41. By postulating a piecewise linear solution 
p;=aj, 0<7<k, gy =Bj-Bn+1, k+1<j<n, (3.12) 


one finds that the solution of (3.9), with the boundary conditions given above, 
is given by (3.12) with 





Hence 
Tk 
= —— t, 3.13 
Pk eh(1 —e€)/(1 +£) +(1-—E)tk +E (3-13) 
Let x* = 2441. The exact solution in 2, is according to (3.11) 
Tk 
I.) = m, 
(te) (l—e)apai te 
Hence, the error satisfies 


pr = plz) = 0 ( =) 


E 
l+e 





As another example, let z* = z, + h/2. The numerical solution in zx is still 
given by (3.13). The exact solution in x, is according to (3.11) 


Lk 
Plex) = (1 — E)zk +E +h(1—e)/2- 


The error in x, satisfies 


pr — plek) =O (EL) . 


e(1 +e) 





When a(z) is continuous (e€ = 1) the error is zero. For general continuous 
a(x) the error is O(h?). When a(z) is discontinuous, the error of the scheme 
(3.8) increases to O(h). 
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Finite volume discretization 


By starting from the weak formulation (3.4) and using finite volume dis- 
cretization one may obtain O(h?) accuracy for discontinuous a(z). The do- 
main £2 is covered by cells or finite volumes {2;: 


NR; = (xj; —h/2, xz; +h/2), jul,.,n—-—1. 
Let p(z) be the characteristic function of §2;: 
v(iz)=0, rg Ri; v(x) =1, zE}. 


A convenient unified treatment of both cases: a(x) continuous and a(z) dis- 
continuous, is as follows. We approximate a(z) by a piecewise constant func- 
tion that has a constant value a; in each §2;. Of course, this works best if 
discontinuities of a(z) lie at boundaries of finite volumes {2;. One may take 
a; = a(z;), or 


a; = h=! f aan. 
2; 


With this approximation of a(x) one obtains from (3.5) 


- [loes)sde 
2; 
a 


palin f 1<j<gn-t. (3.14) 


a(y, y) 


By taking successively j = 1, 2,...,n~—1, equation (3.4) leads to n—1 equations 
for the n — 1 unknowns p; (fo = 0 and pn = 1 are given), after making 
further approximations in (3.14). 


In order to approximate ay1(z; + h/2) we proceed as follows. Since au, is 
smooth, p(x; + h/2) does not exist if a(x) jumps at x = x; +h/2, because 
of the jump condition. Hence, it is a bad idea to discretize p (x; + h/2). 
Instead, we write 


Ta41 TIi+ı 1 aria 
pit = J p dr = f ae de S (ap 1)j+1/2 J Pokat (3.15) 


where we have exploited the smoothness of ay. The piecewise constant 
approximation of a gives 


Das 
-dr=h , 
f a z [Wi , 


Tj 
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with w; the harmonic average of a; and a;41: 
wj = 2ajaj+1/(aj + 4541) - 
We obtain the following approximation: 
(49,1) j41/2 = Wi (Yj41 — Pi) /h. (3.16) 


With equations (3.14) and (3.16), the weak formulation (3.4) leads to the 
following discretization: 


wy—1(~j — %j-1)/h — wy (pii — pi)/hk = hgj, J =1,2,..,n-1, (3.17) 


with 


When a(z) is smooth, w; œ (a; +a;+1)/2, and we recover the finite difference 
approximation (3.9). 


Equation (3.17) can be solved in a similar way as (3.9) for the interface 
problem under consideration. Assume 2* = ry + h/2. Hence 


we=e, le jck; we=2e/(l +e); wz=1, k<j<n-l. 
By postuldting a solution as in (3.12) one finds 
B=ac, a= h/(x* —ex*+e). 


Comparison with (3.11) shows that p; = y(z;): the numerical error is zero. In 
more general circumstances the error will be O(h”). Hence, finite volume dis- 
cretization is more accurate than finite difference discretization for interface 
problems. 


Discontinuity inside a finite volume 


What happens when a(z) is discontinuous inside a finite volume 92], at z* = 
zj, say? One has, with y as before, according to (3.5): 


+h/2 , 4: 
aly, y) = -apoti + Jim apı— lim apa. 


The exact solution u satisfies the jump condition (3.6); thus the last two 
terms cancel. Appoximating p,ı by finite differences one obtains 


aly, p) = —aj41/2(Pj+1 — pi)/h + aj-1/2(Pj — Pj-1)/h . 
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This leads to the following discretization: 


[-a;-1/23-1 + (aj-1/2 + Oj41/2) 5 — aj+1/22;+1]/h = hay. (3.18) 


For smooth a(x) this is very close to the finite difference discretization (3.9), 
but for discontinuous a(x) there is an appreciable difference: (3.18) remains 
accurate to O(h?), like (3.17); the proof of this left as an exercise. 


We conclude that for interface problems finite volume discretization is more 
suitable than finite difference discretization. 


Exercise 3.3.1. The discrete maximum and lz norms are defined by, respec- 
tively, 


Ilo = max{|p;|:0<F<n}, lelo =h Aye. (3.19) 
j=0 
Estimate the error in the numerical solution given by (3.12) in these norms. 


Exercise 3.3.2. Show that the solution of (3.18) is exact for the model prob- 
lem treated in this section. 


3.4 Vertex-centered discretization 
Vertex-centered grid 


We now turn to the discretization of (3.1) in more dimensions. It suffices to 
study the two-dimensional case. The computational grid G is defined by 


G={wen: æ= jh, j = (jı, j2) , Ja = 0, 1, na, h = (hi, ha), 
ha = 1/na}. 


G is the union of a set of cells, the vertices of which are the grid points æ € G. 
This is called a verter-centered grid. Figure 3.1 gives a sketch. The solution 
of (3.1) or (3.4) is approximated in æ € G, resulting in a vertez-centered 
discretization. 


Finite difference discretization 
Forward and backward difference operators A, and Va are defined by 


Aa; E (Yjtea — Pi)/has Vapi = (Pi — Pi-ea)/Ra , 
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Fig. 3.1. Vertex-centered grid. (ə grid points; - - - finite volume boundaries.) 


where e} = (1,0) , e2 = (0,1). Of course, the summation convention does 
not apply here. Finite difference approrimations of (3.1) are obtained by 
replacing 0/Orq by Ag or Va or a linear combination of the two. 


We mention a few possibilities. A nice symmetric formula is 
1 1 
—51V 5 (dap Aa) + Ap(dasVa)}y + 5(Vat Aa)(bay) +ep =q. (3.20) 


The finite difference scheme (3.20) relates p; to p in the neighbouring grid 
points @;+¢,, ®jte,ze,- This set of grid points together with æ; is called the 
stencil of (3.20). It is depicted in Fig. 3.2(a). This stencil is not symmetric. 
The points 2j4¢,e, enter only in the stencil when aj2 Æ 0. The local dis- 
cretization error is O(h}, h2), and so is the global discretization error, if the 
right-hand side of (3.1) is sufficiently smooth, if the boundary conditions are 
suitably implemented, and if aag is continuous. It is left to the reader to write 
down a finite difference approximation with stencil as in Fig. 3.2(b). The av- 
erage of Figures 3.2(a) and 3.2(b) gives 3.2(c), which has the advantage of 
being symmetric. This means that when the solution has a certain symmetry, 
the discrete approximation will also have this symmetry. With Fig. 3.2(a) or 
3.2(b) this will in general be only approximately the case. A disadvantage of 
Fig. 3.2(c) is that the corresponding matrix is less sparse. 


Boundary conditions 


Although elementary, a brief discussion of the implementation of boundary 
conditions is given, because a full discussion with a,2(z) Æ 0 is hard to find 
in the literature. If æ; € ðN and a Dirichlet condition is given, then (3.20) is 
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(a) (b) (c) 


Fig. 3.2. Discretization stencils. 


not used in æ;, but we write y; = f with f the given value. The treatment 
of a Neumann condition is more involved. Suppose we have the following 
Neumann condition: 


alap, a(l, £2) = f (x2) (3.21) 


(in physical applications (3.21) is more common than the usual Neumann 
condition ¢ ı = f, and (3.21) is somewhat easier to implement numerically). 


Let æ; lie on zı = 1. Equation (3.20) is written down in x;. This involves 
pj values in points outside G (virtual points). By means of (3.21) the virtual 
values are eliminated, as follows. First the virtual values arising from the 
second-order term are discussed. Let us write 


1 _ _ _ 
—91V 5 (aap Aa) + Ag(@apVa)}¥j = q; Pje +g; * Pjter-ez +q; ‘pie: 


1 2 3 
+9303 +95 Piter +4; Pj-ertes +95 Piter 1 


(3.22) 
with 

g7? = —(a22,j-e, + a22,;)/2h2 — (a12,j-e, + 812,4)/2hıh2 , 
q7? = (a12,j-ez + 012,j+e1)/2hıh2 , 
q7 = —(a11,j;-e, + @11,j)/2h} — (@12,j-e, +.412,3)/2hihe , (3.23) 

B= Gees GGe G E Gje: 

TS 

m#0 


By Taylor expansion one finds that approximately 


-47 (Yi T Piza) + 95 (Pite: ~ 3) — (pj - Pjertea) 
+4; (Pj+ei-ez 7 pj) = 4 MUaP,a(z;) = acd (#2) . 


This equation is used to eliminate the virtual values from (3.22). The first 
order term (b;y),1 is discretized as follows at æ = æ;: 


(bip) 1 = b11¢9 + bip = bip + bi (f — a129,2)/an , 
and 2 is replaced by $(Az + V2)y;- 
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Finite volume discretization 


For smooth aqg(#) there is little difference between finite difference and finite 
volume discretization, but for discontinuous agg it is more natural to use 
finite volume discretization, because this uses the weak formulation (3.4), 
and because it is more accurate, as we saw in the preceding section. 


The domain £2 is covered by finite volumes or cells §2;, satisfying 
2=|J2;, R NNR; = 4, if<j. 
J 
The boundaries of the finite volumes are the broken lines in Fig. 3.1. Except 
at the boundaries, the grid points æ; are at the centre of 92;. 


The point of departure is the weak formulation (3.4), with a(y, Y) given by 
(3.5). Let y be the characteristic function of 92;: 


y(æ)=0, #€2;, y(æ)=1, ENR. 


The exact solution satisfies the jump condition; thus the integral along T in 
(3.5) can be neglected. One obtains 


alp, V) +H(9,¥) + elp ¥) =- f (dope 0)2d2+ | (bap), ad2+ | evan 


Nj N, £2; 
= f aaspangdr + f bapnadr + f cpd9 = f gar 
Tj T; Ny Nj 
(3.24) 


where we have used the divergence theorem, assuming that agg (æ) is contin- 
uous in §2;, and where I; is the boundary of 9;. We approximate the terms 
in (3.24) separately, as follows: 


[rae lola, [cede ~|Ajles95 

Nj Nj 
where |§2;| is the area of §2;. For the integrals over I} we first discuss 
the integral over the part AB of Ij, with A = æ; + (hi/2,—h2/2) , 
B = æ; + (hi/2,h2/2) ; 2; is assumed not to be adjacent to 092. The fol- 
lowing approximations are made: 


B B 
J bodes = ha(biy)ec ; [eareades = he(@a1¥,a)C ` (3.25) 
A A 


where C is the center of AB: C = æ; + (hi /2,0). The right-hand sides of 
(3.25) have to be approximated further. 
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Continuous coefficients 


First, assume that agg (æ) is continuous. We write 


B 


A 


and 
B 
| expades = hay2(C)(V2ej+e: + 429;)/2 
A 
or 
B 
f apades = heay2(C)(A2 + V2)(~j + ¥jte.)/4 
A 
or 


B 
J ozoda = hoaie(C)(Arpjre, + Vopi)/2. 
A 


Discontinuous coefficients 


B 
f tredan = hzbı(C)(pi + Pj+e1)/2 5 f onpadaz = hari (C) Ay; 
A 


(3.26) 


(3.27) 


(3.28) 


(3.29) 


Assume that a,g(#) is continuous in 9;, but may be discontinuous at the 
boundaries of 92;. In the approximation of the right-hand sides of (3.25), the 


jump condition (3.6) has to be taken into account. 


For simplicity we assume @j2 = 0. Proceeding in the same way as in the 


one-dimensional case in the preceding section we obtain 


B 
[arpades = how; 419; , 
A 


with 


w; = 2a11,j011,j4¢,/(@11,3 + @11,;+e,) - 


The convective term in (3.25) may be approximated as follows: 


ha(biy)o = habi(C) (pj + Piter)/2- 


(3.30) 


(3.31) 


(3.32) 
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If a,; is strongly discontinous it is more accurate to approximate y(C’) as 
follows. According to the jump conditon (3.6), with a12 = 0, we have approx- 
imately 


ai1(#j)(p(C) — pi)/hi = a11(@j4e1)(Pj+er — Y(C))/Ar - 
Solving for y{C) gives 


p(C) = {aii (@;) 9; + G11 (Bj +e.) Pj4e, }/(@11(#j) + @11(Bj+e,)) - 
Substitution in (3.25) results in 


ho(hiy)c = stabi (C)w; {p/a (2j+e,) + pj+e,/a11(#;)} . (3.33) 


The integrals along the other faces of 2; are approximated in a similar fash- 
ion. 


Just as in the one-dimensional example discussed earlier, one may also as- 

sume that aag(æ) is continuous across the boundaries of the finite volumes, 

but may be discontinuous at the solid lines in Fig. 3.1. Then we approximate 

aap(æ) by a constant in each cell bounded by solid lines. The integral over 

AB is split into two parts: over AC and over CB. One obtains, for example, 
c 


[ eoreaden = hafan(A)Ap; + anlApa) (3.34) 
A 


where p 2 has to be approximated further. Now the case aj2(z) Æ 0 is easily 
handled, because the jump conditions do not interfere with the approximation 
of p2. For example, 


p,2 = V2(~j + Yj4e1)/2- 


For the convective term the approximation (3.32) may be used. 


Boundary conditions 


The boundary conditions are treated as follows. If we have a Dirichlet con- 
dition at æ; we simply substitute the given value for y;. Suppose we have a 
Neumann condition, for example at x; = 1: 


aa1P all, £2) = f(x) . (3.35) 
Let AB lie on x; = 1. Then we have 
B 
[enreodes = hof(x2,;) (3.36) 


A 
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and 


B 
J broda = habi 59; . 
A 


Exercise 3.4.1. Derive a discretization using the stencil of Fig. 3.2(b). 
(Hint: only the discretization of the mixed derivative needs to be changed.) 


3.5 Cell-centered discretization 
Cell-centered grid 


The domain 92 is divided in cells as before (solid lines in Fig. 3.1), but now 
the grid points are the centers of the cells, see Fig. 3.3. The computational 
grid G is defined by 


, oo‘, 1 1 
G={rERN: T= £j = (j — p)h, j = (ji, j2) P = (5:5) 


h = (hı, hz), ja = 1,2, Ras ha = l/na} . 


The cell with center æ; is called §2;. Note that in a cell-centered grid there 
are no grid points on the boundary 022. 


DOOCOTEP. 
pote te pete te 
ledede te te 
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Fig. 3.3. Cell-centered grid. {è grid points; —- finite volume boundaries.) 
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Finite difference discretization 


Finite difference discretizations are obtained in the same way as in Sect. 3.4. 
Equation (3.20) can be used. 


Boundary conditions 


Suppose 92; is adjacent to xı = 1. Let a Dirichlet condition be given at 
2,=1: o(1,22) = f(x). Then (3.20) is written down at æj, and @ values 
outside G are eliminated with the Dirichlet condition: 


Piter = 2f (£25) — Pj - 
When we have a Neumann condition at x; = 1 as given in (3.21) then the 
procedure is similar to that in Sect. 3.4. Equation (3.20) is written down 
at 2;. Quantities involving values outside G (virtual values) are eliminated 
with the Neumann condition. Using the notation of equation (3.23), we have 
approximately 


G5 (Yjter — Pi) +95 "(Piter—e2 — $j) * ~A1a¥,a(1, 2)/hi = -f (22)/h: . 


This equation is used to eliminate the virtual values. 


Finite volume discretization 


In the interior, cell-centered finite volume discretization is identical to vertex- 
centered finite volume discretization. When a,g(#) is continuous in S? then 
one obtains equations (3.26)—(3.29). When ags{#) is continuous in £2; but 
is allowed to be discontinuous at the boundaries of Q; then one obtains 
equations (3.30)—(3.33). We require a12(z) = 0 in this case. When aag(æ) is 
allowed to be discontinuous only at line segments connecting cell centers in 
Fig. 3.3, then one obtains equation (3.34). 


Boundary conditions 


Because now there are no grid points on the boundary, the treatment of 
boundary conditions is different from the vertex-centered case. 


Let the face AB of the finite volume £2; lie at zı = 1. If we have a Dirichlet 
condition (1, 22) = f(r2) then we put 
B 


[ eroPade: ~ 2hza11,; (F(C) — pi)/hı + hzaizdf (C) /dz2 
A 
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and 


B 
f tredan = heb (C) f (C) 3 
A 


where C is the midpoint of AB. 


If a Neumann condition (3.35) is given at zı = 1 then we use (3.36) and 


B 


f bipdzz = hybi(C)p(C) , (3.37) 
A 


where p(C) has to be approximated further. With upwind differencing, to be 
discussed shortly, this is easy. Higher order accuracy can be obtained with 


1 
(C) = pj + 5fie(C) 


where the trick is to find a simple approximation to p (C). If aag(x) is 
continuous in §2;, then we can put, using (3.35), 


ya(C) = {F(C} - aie 2(C)}/ar13 
~ {f(C) - 5012,5(V2 + A2) 9; }/a11,; - 


If aag(æ) is discontinuous at lines connecting finite volume centers (grid 
points) then (3.35) gives 


at, ;¥,1 (C) + ats ;p%(C) = f(C), 
at, 2,1 (C) + aiz 59 (C) = f(C), 


where the superscripts 1 and 2 indicate lim te,(c) and lim,,)2,(c), respec- 
tively. Taking the average of the preceding two equations and approximating 
Y,2 one obtains 


yi(C)= S4C) a at, Arp; }/a5,; + strc) = aia V29; }/411,; . 


3.6 Upwind discretization 
The mesh Péclet number condition 


Assume aı2 = 0. Write the discretization obtained in the interior of 2 with 
one of the methods just discussed as 


~3 -1 0 
q; Pj-ez +q; Pj-e, +q; Pj + J Pje + d$ Pjtez = fj. 
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As will be discussed later, for the matrix A of the resulting linear system to 
have the desirable property of being an M- matriz, it is necessary that 

q; <0, v= 4l, £3. (3.38) 


Let us see whether this is the case. First, take aag(æ) and ba(æ) constant. 
Then, apart from a scaling factor, all discretization methods discussed lead 
in the interior of {2 to 


1 1 
qr? ~ a habe — hya22/h2 , qj" = ~ 5 habs —heaii/h1 , 


1 1 
q} = 5 habs — hzāa11Ř1ı , q; = zb — hiaz2/h2 , q? = -Ņ) g - 


v#0 


From (3.38) it follows that the mesh Péclet numbers Pa, defined as 
Py = |balha/daa (no summation) 
must satisfy 
Pa <2. (3.39) 


With variable aag(æ) and b,(#) the expressions for q7 become more com- 
plicated. Let us take, for example, cell-centered finite volume discretization, 
with a,,(#) continuous inside the finite volumes, but possibly discontinuous 
at their boundaries. Then one obtains 
q7? = —hibz j-e2/20j-e3/2022,j-e3 — h1Vj-e2/ħ2 , 
q5° = —hoby j~0,/2Wj—e,/2011,;-e, ~ hewj—e,/h1 , 
q; = hobi j4e,s2wj/2a11,3 ~ how; /hr , 
q; = hybe j4e,0;/2G22,; — hyvj/he , 
q} = hyvj—e,/h2 + howj—e,/hi + hivj/he 
+ habi j4e,/2W;j/2011, j4e, — hobij—e,/2Wj—e,/(2011,3-e1) 
+ hide 5 46,/20; [2422 j +6. — hibz 5; ~05/20j—e2/(2022,;-e2) , 


where w; is defined by (3.31), and vj = 2a92,;422,;4¢,/(@22,j + 422,j+e2): 
Again, for A to be an M-matrix, equation (3.39) must satisfied, with Pa 
replaced by P,,;, defined by 


Pa, = lba jtes/2|ha/daaj (no summation) . 


Upwind discretization 


In computational fluid dynamics applications, often (3.39) is not satisfied. 
In order to have an M-matrix, the first derivatives in the equation may be 
discretized differently, namely by upwind discretization. This generates only 
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non-positive contributions to q? v Æ 0. 


First we describe the concept of flux splitting. The convective fluxes b,@ are 
split according to 


bop=fi+ fz- 


First-order upwind discretization is obtained by the following splitting: 


1 
fă = 5 (boy  lbaly) , 
and by the following finite difference approximation: 
(bap) a = Vast + Aaf. (3.40) 


In the finite volume context, upwind discretization is obtained with (cf. (3.26), 
(3.32) and (3.37)) 


B 
[bodes = hol fy, + Si j4e,) . (3.41) 
A 


Upwind discretization reduces the truncation error to O(h,). Much has been 
written in the computational fluid dynamics litarature about the pros and 
cons of upwind discretization. We will go into this later. The interested reader 
may consult Roache (1972) or Gresho and Lee (1981). 


The mixed derivative 


When ai2(z#) Æ 0, condition (3.38) may be violated, even when Pa = 0. In 
practice, however, usually aj2(a#) Æ 0 does not cause the matrix A to deviate 
much from the M-matrix property, so that the behavior of the numerical 
solution methods applied is not seriously affected. See Mitchell and Griffiths 
(1994) and Exercise 3.6.1 for a discretization of the mixed derivative that 
leaves (3.38) intact. 


Boundary conditions 


Upwind discretization makes the application of boundary conditions easier 
than before, provided we have the physically common situation of a Dirichlet 
condition at an inflow boundary (bana < 0 with n the outward normal on 


A). 


In the vertex-centered case, if x; = 1 is an inflow boundary, the Dirichlet 
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condition is applied directly, and (3.41) is not required. If zı = 1 is an 
outflow boundary (bı > 0), (3.40) gives 


(iy)ix Vifi, (3.42) 
whereas (3.41) becomes 
B 
J bivaes ~ hoff; ; (3.43) 
A 


so that no virtual values need to be evaluated. In the cell-centered case with 
finite differences, if z} = 1 is an inflow boundary, a suitable approximation 
at this boundary is 


(bip), 1 = 2(b1(1, x2)g(£2) — bij) /hi , 


with g(r2) the prescribed Dirichlet value, whereas in the outflow case we have 
(3.42). With finite volumes we have in the case of inflow 


B 


f troda ~ hz2bı(1, t2)g(x2) 
A 


and equation (3.43) in the case of outflow. 


Exercise 3.6.1. Show that in order to satisfy (3.38) in the case that aiz Æ 0 
one should use the seven-point stencil of Fig. 3.2(a) if a12 < 0 and the stencil 
of Fig. 3.2(b) if aı2 > 0 (cf. Exercise 3.4.1). Assume aag = constant and 
b, = c = 0, and determine conditions that should be satisfied by aj. for 
(3.38) to hold; compare these with (2.4). 
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We now turn to nonuniform grids. We consider only the one-dimensional 
case, because much can be learned from it with little effort. Two-dimensional 
applications will be studied in the next chapter. 


Difference between finite volume and finite difference discretization 


Let us consider the convection-diffusion equation with constant coefficients: 


d? d 
Lp =- tu =g, (3.44) 
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Fig. 3.4. A nonuniform grid. 


Figure 3.4 shows a nonuniform grid. There is no need to consider the bound- 
aries. Central finite difference discretization is given by 


Lg; E= —D{(pj41 — pi)/hi+i — (pj - yj-1)/hy Y hi+1/2 
+ ulyj41 — Pj-1)/(hj + hj 41) 
= pj-1(—D/h; — u/2)/hj 41/2 + 29; D/ (Aj hj 41) 
+ 93 41(—D/hjy1 + u/2)/hj41/2 = 4; | 


where hy4i/2 = (h; -+ h;+1)/2, and h; = Ai; -ijz cf. Fig. 3.4. 


With vertex-centered finite volume discretization the volume boundaries are 
placed midway between the nodes. We write 


[bode = PEHI = hasan 
N, 
with 
F=F"4F°, F” =-—Ddọ/dr, F° = ugo. 


We will call F the flux, F” the viscous flux and F° the convective flux. Central 
approximation gives 


1 
Frei = —D(vj41 — pi)/hj+1 , Fr saya = ztl + Pj+1) 


The resulting discrete approximation is 
Lipi = hi41/24; , f= hjzisoLly . (3.45) 


Hence, finite difference discretization and vertex-centered finite volume dis- 
cretization are equivalent. 


With cell-centered finite volume discretization the domain is first divided into 
cells, and the nodes are placed in the cell centers, as shown in Fig. 3.4. Let H; 
be the size of the cell with center at 2;. The cell-centered flux approximations 
are 
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č [a 1 
= -D(pj41 — Pi)/Hj+i23 Fh yaya = gules + pier)» (3-46) 
with Hy41/2 = (Hj + Hj41)/2- The following discrete approximation results: 


Lepi = j-1(—D/Hj-1/2 — u/2) + (3.47) 
pj D(1/ Hy —1y2 + 1/ Aj 41/2) + 95 41(—D/ Aj 41/2 + u/2) = Hiqi - 


If the grid is uniform the three discretizations Lj, Ly and L. are identical, 
but on non-uniform grids Le is different from L, and Ly, since Hj_~1/2 = hj, 
but H; Æ hj41/2, cf. Fig. 3.4. This results in a striking difference in local 
truncation error, as we shail now see. 


Definition 3.7.1. The local truncation error of the discrete operator La is 


Tj = La{elzi)- ej}, FHl end, (3.48) 
with y(x) the exact solution. 


Definition 3.7.2. Consistency 
If L is the differential operator to be approximated, and Lap; approximates 
h? Lip(xj), then La is called consistent if 


lim 75 /hj = 0, j=l,- J, A= max{h;: j=l, J} 


This implies that the size of r is measured in the maximum norm. If we 
use a weaker norm, such as proposed for example in Spijker (1971), then 
an inconsistent (by our definition) scheme may get reclassified as consistent. 
In fact, this is the case for the inconsistent schemes that we will encounter. 
But we will stick for simplicity to the above definition. As a consequence, an 
inconsistent scheme may be convergent, as will be seen. 


By Taylor expansion we find for the vertex-centered scheme (3.45), denoting 
the local truncation error by r": 


1 1 
— 3D) + zue” igs apalhjes — h;) + O(A®), (3.49) 


where p(®) = d%p(x;)/de*, a= 2,3, and 


rt = ( 
A, = max{hj: j=1,..,J7}, 


with J number of nodes. The local truncation error 7° of the cell-centered 
scheme (3.47) is found to be 


C l 1 
r = (—ZDe + Fup) (Hjsa ~ 2H; + Hy) (3.50) 
1 1 
+- gP” + ue) Aaa j2 = H? j2) + O(A?) ; 
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where 
A, = max{ H; : j=1,..,J-1}, 
with J — 1 the number of cells. 
The grids are called smooth if 
|Aj41 — hil = O(A) 
in the vertex-centered case, and 
|Hj41j2~ Hj-1/2| = O(A) and |Hj41 — 2H; + Hy-1| = O(Ae) 


in the cell-centered case. On smooth grids, 


r= O(A*) (3.51) 
in both cases. But on arbitrary grids 
r? = O(Ai), r =O(A-). (3.52) 


We see that the cell-centered scheme is not even consistent. However, it 
would be wrong to think on the basis of (3.51) and (3.52) that the accu- 
racy on smooth grids is necessarily better than on arbitrary grids, or that 
vertex-centered discretization is more accurate than cell-centered discretiza- 
tion. This follows from a study of the error, to the presented below. As noted 
earlier, the seemingly inaccurate cell-centered finite volume discretization is 
in widespread use, also on nonsmooth grids, and is known as block-centered 
discretization in reservoir engineering (Aziz and Settari (1979)). Fortunately, 
it is not necessary to advise against its use because of its large local truncation 
error, even though it is not consistent. 


Global truncation error 


Definition 3.7.3. Global truncation error 
The global truncation error is defined as 


ej = 9(2j) — 9; 
with v(x) the exact solution. 
According to (3.48) the local and global truncation errors are related by 
Lge; = 7; . (3.53) 


Estimates of e; are given in Tikhonov and Samarskii (1963), Manteuffel and 
White, Jr. (1986), Forsyth, Jr. and Sammon (1988) and Weiser and Wheeler 
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(1988). Both in the vertex-centered case (3.45) and in the cell-centered case 
(3.47) we have 


ej = O(A?). (3.54) 


Because this satisfactory accuracy result is the most surprising for the in- 
consistent, cell-centered scheme, we will partially sketch a proof of (3.54) for 
this case only, following the theory developed in Forsyth, Jr. and Sammon 
(1988) for the two-dimensional diffusion equation. We will include a convec- 
tion term, but restrict ourselves for simplicity to the one-dimensional constant 
coefficient case. 


Alternative discretizations of the convection term 


Before sketching a proof of (3.54) we introduce two additional discretizations 
of the convection term. One may try to improve accuracy on nonsmooth grids 
by replacing (3.46) by linear interpolation for the convective flux: 


Fiyi = (Hipis + Hipgpi)/ (H; + Hja) - (3.55) 

A third possibility is the following finite difference approximation (studied 

in Veldman and Rinzema (1992) for the vertex-centered case), obtained by 
fitting a parabola through p;j-1, Pj, Pj+1: 

uH; Hj-1/2 

Ay _ajat+ Hj41/2 Hj41/2 

„Hiru? 

H;-1/2 


Hyudy(z;)/dx = 





(Pi41— pi) (3-56) 


(pj ~ pj-1)} - 


The contributions of these approximations of the convection term to the local 
turncation error 7° are 


1 


gH (Hits -— H;j—ı)ud’p(z;)/dz? + O(A2) . (3.57) 


for (3.55) and O(A3) for (3.56). As expected, these discretizations have a 
smaller local truncation error than (3.46). On uniform grids the three dis- 
cretizations are identical. 


Estimate of global truncation error for cell-centered scheme 


The validity of (3.54) can be made plaussible as follows. Define the following 
grid functions: 
j j 
HEH}, p=} Hiz, Hj =X (H+ Hg)Hs-1/2, (3.58) 
k=1 k=1 
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where Ho = 0. We find with Le defined by (3.47) and yx(x), k = 1, 2,3 
smooth functions to be chosen later: 


Le(di(z;)uj) = Dir (2;)(-2Hj41 + 4H; — 2Hj-1) + (3.59) 
dyl z; l 
+ (pE ou (25) MH} — H3,1) + (49), 
Le (a(e;)u3) = Daa(aj)(AG 12 — Higi) + O(A2) (3.60) 
L.(b3(zj)u3) = Dus(2;)(H}_1 — H},1) + O(A2) . (3.61) 
We choose 
1 u 1 u 
2) #0) — tol) (2) 
ae pe i sv ip? 
__oyr Ue 
v3 de 3D” 
and define 
ef =Ue(aj)u;, k= 1,2,3. (3.62) 


Remembering (3.53), comparison of (3.59)—(3.61) with (3.50) shows that 


L.(e; —e; —e; — e) = O(A®). (3.63) 
The right-hand side is of the same order as the local truncation error in the 
uniform grid case, which makes it plausible that 


€j — ej — e; — es = O(A?). (3.64) 


Since ef = O(4Ž) , k = 1,2,3 we find 
e; = O(A?). (3.65) 
which is what we wanted to show. 


Of course, for a complete proof the boundary conditions have to be taken 
into account. Furthermore, it has to be shown that (3.63) gives (3.64). This 
can be done using the maximum principle (to be discussed later) but we will 
not do this. The maximum principle holds if the discretization matrix is an 
M-matrix. If it is not, (3.64) becomes doubtful. In the present case we do 
have an M-matrix, provided the mesh Péclet number is small enough. This 
will be more fully discussed in the next chapter. 


Complete proofs of (3.65) are given in Manteuffel and White, Jr. (1986) 
(for vertex-centered discretizations, and for a slightly different cell-centered 
discretization) and in Forsyth, Jr. and Sammon (1988) (for D variable, but 
u = 0); in the same situation second order convergence in the discrete /7-norm 
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is proved in Weiser and Wheeler (1988). Hence, the inconsistent. scheme de- 
fined by (3.47) has second order convergence on arbitrary grids, so that its 
widespread application is justified. 


A similar method can be applied to the other two discretizations of the con- 
vection term. For (3.55), with associated contribution to the local truncation 
error 7° given by (3.57), we choose ut, u? and p° as in (3.58), and 


1 1 
yı = zp” ; pa = ze” ’ P3 = -dy /dz . 


With e" acording to (3.62) we again have (3.64). This holds also for (3.56). 


Spectro-consistent discretization scheme 


In the continuous case convection corresponds to a skew-symmetric operator, 
whereas diffusion corresponds to a symmetrie semi-positive definite operator. 
In Verstappen and Veldman (1998) it is shown that it is beneficial for accuracy 
if these properties are shared by the discretization scheme. Furthermore, in 
the nonstationary case the scheme is automatically stable in the energy norm 
if the symmetric part of the spatial discretization operator is semi-positive 
definite. In Verstappen and Veldman (1998) this is called spectro-consistent 
discretization. We will check these properties. 


The discrete scheme leads to a linear system that we may denote as 
(B + C)en =b ' 


with B the diffusion matrix and C the convection matrix. The cell-centered 
scheme (3.47) gives 











e e e 
_ __1 1 1 __1 
B=D H,-1/2 HĦHji-1/2 t H,41/2 H,41/2 , 
e e e 

u e e e 
C = = -1 01 

2 

ees 


The conditions for a spectro-consistent scheme are obviously satisfied. How- 
ever, for the seemingly more accurate discretization of the convection term 
(3.55) we obtain 











e n e 
p H, H, Hy-2 H, 
C= — H,-3/2 H, -1/2 H,-3/2 H,-1/2 
2 — 3 A434 Ay-1 H, 
H,-1/2 H4172  H;-1/2 H4172 
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which is clearly not skew-symmetric. Furthermore, the generic row of (C + 
CT) /2 is given by 


te H;-ı— H; Hjisı _ Hj- Hj — Ht, 

2 2Hj-1/2 Hjşız2 Hj-1/2 2H;j41/2 

so that the symmetric part of B + C is not necessarily semi-positive definite. 
Hence, with (3.55) we do not have a spectro-consistent scheme, which spells 
trouble, as shown by examples presented in Verstappen and Veldman (1998). 


Discussion 


Despite the differences in local truncation error on nonsmooth grids for the 
three discretizations of the convection term (3.46), (3.55) and (3.56), we 
have made it plausible that in all three cases the global truncation error 
e; = O(A?), regardless of grid smoothness. Hence, there is no reason to 
think that (3.55) is more accurate than (3.46), or that (3.56) is more accu- 
rate than (3.55). In fact, the opposite may be true on nonsmooth grids. If 
H; ~1/2/Hj41/2 > 1 then (3.56) approximates backward differencing, which 
is likely to cause numerical wiggles when u > 0. This is confirmed by numer- 
ical experiments with (3.56) on a vertex-centered grid reported in Veldman 
and Rinzema (1992). When H;/Hj41 >> 1 and H;-1/H; > 1 then (3.55) 
also approximates backward differencing. 


Another indication of possible trouble is the fact that (3.55) and (3.56) may 
make a negative contribution to the main diagonal of the resulting discretiza- 
tion matrix, i.e. to the coefficient of y;. As a result, the discretization matrix 
is not an M-matrix, and (3.63) does not give (3.64). Our conclusion is, that 
the inconsistent scheme (3.47) is perfectly all right, and that the derivation 
of more accurate schemes should be approached with caution on nonsmooth 
grids, because making the local truncation error smaller may make the global 
truncation error larger. That this may happen is easy to see. According to 


(3.53) 
ej = Litr. 


A change in La that makes |r;| smaller but that makes ||Z7"|| larger may 
make {e;| larger. Of course, the choice of norm is important here. 


Finally, we have shown that (3.55) does not lead to a spectro-consistent 
scheme. This can also be shown for (3.56). 
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Accuracy at boundaries 


Let a Dirichlet condition be given at z = 0: 


g(0)=f. 
For simplicity we assume u = 0 and D = 1. The vertex-centered scheme 


(3.45) becomes at the boundary 


1 1 1 
Leg. = -— —+—)=h hy, 
pı hf? + Pil + 7) 3/241 + f/h1 


with local truncation error rj given by (3.49), as in the interior. 


For the cell-centered scheme (3.47) this boundary condition is implemented 
as described in Sect. 3.5, resulting in 














2 1 2 
c21 =— —)= H 2f/Hı, 3.66 
Lepi Hypa + Pil + Hy 191 + 2f/H1ı (3.66) 
with local truncation error 
«1 dy(ti) 1,12 p2 Eele) 3 

Ti = zH -+ 2H) dr? gami — 3/2) dr? + O(A?) ` (3.67) 

In the case of a Neumann condition 

—dy(0)/dx =f, 


the two schemes become at the boundary, following Sections 3.4 and 3.5: 


1 
Ly yo = — (p1 — po)/hi = z140 +f, 
Levi = — (p2 — pı)/ H32 = Hiq + f, 


with local truncation errors in the grid-point zı, respectively, 











1 ,d3y(0 
p = -in L ofa), 
e_ 1l d'elz) lay 2 4° p(x1) 
Ti = (fi — Aa) det - zH + 4H3/2) dei (3.68) 
+0(43). (3.69) 


We make the following observations. On smooth grids with the Dirichlet 
boundary condition we see that 


ri =0(4), ti =O(A), 


so that even on uniform grids the cell-centered scheme is not consistent at 
the boundary! However, as before, this is no reason to prefer vertex-centered 
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over cell-centered discretization. Nor is it necessary to improve the accuracy 
of the cell-centered (or, on non-smooth grids, also the vertex-centered) im- 
plementation of the Dirichlet condition, because the global truncation error 
is O(A*) in all cases (see Manteuffel and White, Jr. (1986)) and Forsyth, Jr. 
and Sammon (1988)). 


We can make this plausible by a kind of backward error analysis, as follows. 
Suppose we use a perturbed boundary value f in the cell-centered scheme 
(3.66). Then the local truncation error is found to be 


2 
FE pË a n _ 
157 H, (f—f), 
with rfgiven by (3.67). Hence, we approximate the exact solution of the 
differential equation with a perturbed boundary value 


` Hı. 
faftst 


with zero local truncation error at the boundary, so that we expect the numer- 
ical solution to be close to the perturbed exact solution. Since f = f+0(A?) 
we expect the perturbation of the exact solution to be O(A?), and the global 
truncation error also to be O(A?). 


On smooth grids with the Neumann condition, 
ri =O(Al), ry = O(A2), 


which is an order lower than in the interior. Again, this is no reason to 
improve the accuracy of the discretization near the boundary, because even 
on nonsmooth grids we have (3.54), as shown in Tikhonov and Samarskii 
(1963), Manteuffel and White, Jr. (1986), Forsyth, Jr. and Sammon (1988) 
and Weiser and Wheeler (1988). 


Final remarks 


We conclude that. both the vertex- and the cell-centered schemes discussed 
with straightforward implementation of boundary conditions are second order 
accurate. This is true not only for the one-dimensional convection-diffusion 
equation (3.44), (to which we have restricted ourselves here to diminish tech- 
nical details), but more generally. Second order accuracy is shown for the 
one-dimensional convection-diffusion equation with variable coefficients in 
Manteuffel and White, Jr. (1986). The two-dimensional diffusion equation 


~ (aP a), a =q 


with smooth diffusion coefficient a(x) is fully discussed in Forsyth, Jr. and 
Sammon (1988) and Weiser and Wheeler (1988). The case of discontinuous 
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a(x) (important for flow in porous media and reservoir engineering) is in- 
cluded in Tikhonov and Samarskii (1963). 


In the next chapter we treat the two-dimensional convection-diffusion equa- 
tion, including singular perturbation aspects. Our main purpose will be to 
make it plausible, that with suitable local mesh refinement it is possible to 
compute flows at high Reynolds number accurately, despite the fact that the 
local truncation error (causing numerical viscosity) seems large compared to 
the viscous forces. 


Exercise 3.7.1. (cf. Veldman and Rinzema (1992)). Perform numerical ex- 
periments for the following special case of (3.44): 


with exact solution 
p(x) = (e? —1)/(e/? -1). 


Use the cell-centered finite volume method. Choose D = 107? and 107°. 
Choose m cells of equal size in (1— 5D, 1) and vary m. Compare the accuracy 
of the three convection term discretizations (3.47), (3.55) and (3.56). 


Exercise 3.7.2. Show that (3.56) does not lead to a spectro-consistent 
scheme. 


4. The stationary convection-diffusion equation 


4.1 Introduction 


The convection-diffusion equation has been derived in Sect. 1.11. We take 
p = 1, and obtain 


+ tava — (Dpa)a =q, BENCRY, O<t<T. (4.1) 


For the physical significance of the terms in this equation, see Sect. 1.11. 
The equation is assumed to be linear, with y the only unknown. A num- 
ber of important aspects of the numerical analysis of the equations of fluid 
dynamics show up in this simple equation. Its simplicity allows a thorough 
analysis of these aspects, which will be given in this chapter. Readers who 
have some experience in computational fluid dynamics may think at first sight 
that our treatment of such a simple linear equation is too detailed, but it is 
a fact that sometimes controversial and not always well understood impor- 
tant issues, notably the occurrence of numerical ‘wiggles’, the specification of 
outflow boundary conditions, singular perturbation aspects (the occurrence 
of boundary layers when D < 1, in a sense to be made precise shortly, a 
common situation in fluid dynamics) and the role of false (numerical) viscos- 
ity, can be brought out and clarified completely in the context of this simple 
equation. 


Dimensionless form 


Let L and U be typical] length and velocity scales, and let & be a typical scale 
for p. Define dimensionless variables according to 


l 


p = p/ð, u, =ua/U, t =tU/L, r, = falL, 


q = qL/(U8). 


Then (4.1) takes the following dimensionless form, upon deleting the primes: 


Op _ 
Ot + Yaa ~ (Pe “p.a),a =q, 
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where 
Pe=UL/D (4.2) 


is the Péclet number, already encountered in Sect. 1.11. 


Initial and boundary conditions 


At time t = 0 the following initial condition is given: 
p(0,æ) = polz), EN. 


At every point of the boundary 092 of N we have precisely one of the following 
types of boundary condition: 


Dirichlet: g(t,a#)=f(t,z), 2EdOQ, I<t<T, 


Neumann: PČELE L fita), æ E€ 0N, 0<t<T, 
n 


t 
Robin: PČELE | ap = flt,2), EN, 0<t<T, 


with a > 0 and n the outward unit normal on 022. 


Classification 


According to Sect. 2.2, equation (4.1) is parabolic; if D = 0 it is hyperbolic. 
If 0 < D < 1, or more correctly, if Pe >> 1, hyperbolic aspects emerge, al- 
though according to the classification scheme the equation is parabolic. A sort 
of mixture of parabolic and hyperbolic behavior is typical of the convection- 
diffusion equation, and more generally, of the equations of fluid dynamics. 
This is why typical numerical methods for parabolic methods are less suit- 
able for the convection-diffusion equation with Pe >> 1. Exploration of the 
‘no-man’s land’ between the parabolic and hyperbolic realms is one of the 
aims of the present chapter. 


The stationary case is obtained by deleting the time-derivative. In the sta- 
tionary case the foregoing remarks hold with the word parabolic replaced by 
the word elliptic. 


The consequences of Pe > 1 are so far-reaching that this chapter is not just 
a completion of the preceding one. 


Exercise 4.1.1. Show that (4.1) is parabolic. 
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4.2 Finite volume discretization of the stationary 
convection-diffusion equation in one dimension 


The conservation form 


The stationary (time-independent) version of the convection-diffusion equa- 
tion (4.1) is 


Lo = (uap), a — (De,a),a =q, EN. (4.3) 


Here we have replaced uap, a by (uap), a, assuming Ua, a = 0 (divergence-free 
velocity field), so that (4.3) is in conservation form. An equation Ly = q is 
said to be in conservation form if f Lyd? reduces to an integral over 022. 


n 
This is the case here, since with Theorem 1.2.1 we find 


J tie = fiae — Dp,a)nadS . (4.4) 


2 an 


The terminology ‘conservation form’ derives from the fact that in the non- 
stationary case, given by 0y/dt + Ly = q, we have that f yd? is constant 
A , 


(conserved) if there is no production (q = 0) and no transport through the 
boundary 02 (right-hand side of (4.4) equal to zero). 


One-dimensional case, central approximation 


We now restrict ourselves to one dimension. As a result everything becomes 
very simple. This should not tempt the reader, however, to skip this part. 
Because the one-dimensional case is so simple we can be perfectly clear about 
important issues about which one often encounters somewhat foggy notions, 
and which are harder to settle in more dimensions because of more intricate 
technicalities involved. In general we will not repeat the arguments to be 
presented when we go to higher dimensions later. The issues we have in mind 
are treatment and accuracy of outflow boundary conditions, how to handle 
numerical wiggles, upwind discretization and artificial viscosity, and how to 
balance accuracy and low order wiggle-free discretization. Our exhaustive 
treatment of the one-dimensional case will allow us to be brief about the 
higher-dimensional case later. 


We rewrite (4.3) as, replacing u1 by u, 


Let the following boundary conditions be given: 
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p(0) =a, dy(1)/dz=b. 


Finite volume discretization of (4.5) was discussed in the preceding chap- 
ter, mainly for constant u and D. Extension to variable coefficients is easy. 
We discuss only cell-centered discretization; the vertex-centered case can be 
handled in similar way. An arbitrary cell-centered grid is given in Fig. 4.1. 
As in the preceding chapter, the cell with center x; is called §2;, its size H; 
and the interface between $2; and §2;41 is called z;41/2. With finite volume 


Fig. 4.1. Non-uniform cell-centered grid. 


discretization, equation (4.5) is integrated over §2;, giving 


| teag = Ft = [ean = Hq; , 
N, fy 


with Fett? = Fy4is2— Fj-1/2; Fj41/2 = F(£;41/2), F(x) = uy — Ddy/dz. 


j—-1/2 7 
Often, F(x) is called the flux. The following scheme is obtained: 


Fj41/2 — Fi-1/2 = Hiq; ; j = 1,2, agd. 


Conservative discretization 


Summation of equation (4.6) over all cells results in 


J 
Py41;2 — Fy 72 = S Hig; . 


j=1 


(4.6) 


(4.7) 


We see that only boundary fluxes remain, to that equation (4.7) mimics the 
conservation property (4.4) of the differential equation. Therefore the scheme 
(4.6) is called conservative. This property is generally beneficial for accuracy 


and physical realism. 


Discretization of the flux 


Central discretization of the convection term is done by approximating the 


convective flux in the same way as in Sect. 3.7: 


ol 
(up)j41/2 = Fuj+u2l P + Pit) - 


(4.8) 
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Upwind discretization is given, as in Sect. 3.6, by 


1 1 
(uy)s4iy2 = luja + lujia + 5 luiy — luzpazl) j+. (4-9) 
2 2 


This means that uy is biased in upstream direction, which is why this is 
called upwind discretization. 


We allow D(z) to be discontinuous at cell interfaces, as in Sect. 3.3. The 
discretization of the diffusion term in Sect. 3.3 is easily generalized to non- 
` uniform grids as follows (cf. (3.15)): 


d ra 
» m ta 
lyr = (Do )i+1/2 J pi 


Assuming D to be constant in the cells, we obtain 


L543 
1 1 
| pees Empu Hip = 3H; + Hj), 


with 


wj = 
7 Dj Hj. + Dj41 Hi; 


It follows that the diffusive part of the flux is approximated by 


dip N 
(D7 )i+1/2 = wj(pj+1 — Pi)/Hj+1/2 - (4.10) 


The boundary conditions are implemented as described in Sect. 3.5. 


An example 
We will now discuss the numerical consequences of Pe >> 1 , with Pe defined 


by (4.2). Let u and D be constant and let u = U > 0, q = 0. Equation (4.5) 
can be rewritten as 


deo x € (0,1). (4.11) 


plr) = A+ BetPe , (4.12) 


with A and B constants. 
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For the boundary conditions we consider the following two cases: 


(0) =a, y(l)=6 (4.13) 
and 
(0) =a, do(1)/dz =b. (4.14) 
Equation (4.12) gives 
e(z-1)Pe _ -Pe 
plz)=a+ (b-a) Pe (4.15) 
with boundary condition (4.13) and 
p(t) = a + È (eë -DPe _ e-Pe) (4.16) 


Pe 


with boundary condition (4.14). Of course, these solutions satisfy the maxi- 
mum principle of Theorem 2.4.1, since they are monotone. Furthermore, for 
Pe >> 1 they exhibit the boundary layer behavior predicted in Sect. 2.5. 


Equation (4.16) confirms the physical insight that it is wrong to prescribe 
a Neumann boundary condition at an inflow boundary. This can be seen as 
follows. If x = 1 is an inflow boundary, then Pe < 0. For Pe < —1 and z < 1 
equation (4.16) gives 


~ b -Pe 
t) = =e 
so that p(x) is very sensitive to perturbations in b, which means that the 
problem is ill-posed in a practical sense. 


Next, we study the numerical solution of (4.11). We choose the cell-centered 
grid depicted in Fig. 4.1 uniform with H; = h. We start with boundary 
conditions (4.14). The boundary conditions are implemented as in Sect. 3.5, 
resulting in, using the central discretization (4.8): 


Fi se 
Py41/2 


a — 2Pe~*(y, — a) /h, 
yi + thb- Peth. 


In the interior, 


1 - 
Fyaayn = 5 (ei + pj41) — Pe~ (pj — p)/h . 


Substitution in (4.6) results in 
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6 2 4 
1+-)+ l—-j)=a(-4 2), 
pıl z) y2{ > G ) 
2 u 2 
—y;-3(114+ -)+ -g; + gja1(1--)=0, j= 2,3,...,J-1, 
Pj 1( PÈ po Pj+ıl PÈ 


2 2 2 
~yy-1(1 + =) + p(l + =) = ba(=- 1), 
ps-i(l+ 2) tpl + 7) = DAG — 1) 


where p = APe is called the mesh Péclet number. The general solution of 
(4.17) is found in the standard way by postulating solutions of the form 
pj = z/, resulting in, if p £ 0, +2: 


pj =A+Bz, z= (2+ p)/(2- p). (4.18) 


The constants A and B follow from the first and third equation of (4.17). 
For p = 0, i.e. Pe = 0, only the diffusion term is left in (4.11), and the discrete 
solution is given by 


yj =~ A+ Bj. 
For p = 2 the solution of (4.17) is given by 
pia, f=1,2,.,J, (4.19) 
and for p = —2 it is not determined. This is another indication that one should 


not apply a Neumann condition at an inflow boundary. Assume Pe > 0, and 
consider only (4.18). From the first and third equation of (4.17) we find 


so that 
bh | 
pi; =a4+—2-7{-14 (1- EJ} , (4.20) 
P 


Next, consider boundary condition (4.13). it is left to the reader to show that 
in this case the discrete equation in grid point j = J becomes 


2 6 4 
—py-1(14+ -) + yu(-14+ -)=b(- -2 
( z ( z) G72 
With this boundary condition we find for p = 0 or p = 2 that the numerical 
solution is given by, respectively, equation (4.18) or (4.19), whereas otherwise 


we find 


zÍ). (4.21) 
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We now discuss the quality of the numerical solutions (4.20) and (4.21). Our 
most important observation is, that if z < 0 then z? alternates in sign, so 
that the numerical solution shows local extrema called ‘wiggles’. However, if 
z > 0 the numerical solution is monotone, like the exact solution. We have 
z > 0 if and only if 


pl <2. (4.22) 


Exact and numerical solutions for boundary conditions (4.13) are compared 
in Fig. 4.2. Clearly, for p > 2 the numerical solution shows wiggles and is 
not acceptable. With boundary conditions (4.14) (Neumann at outflow) and 


Pe = 10, h = 1/12, Dirichlet Pe = 40, h = 1/12, Dirichlet 


kd 








02 04 0.6 0.8 1 o 02 04 06 0.8 1 
Fig. 4.2. Exact solution (-, equation(4.15)) and numerical solution 
(*, equation(4.21)). 

b = 0 the exact and numerical solutions, given by g(x) = a and (4.20), are 


identical. With b = 1 the results of Fig. 4.3 are obtained. Although there is 


Pe = 10, h = 1/12, Neumann Pe = 40, h = 1/12, Neumann 


È 











0.19 

0.185 
0 02 04 0.6 0.8 1 9.195 02 04 0.6 0.6 1 
Fig. 4.3. Exact solution (-, equation(4.16)) and numerical solution 


(*, equation (4.20)). 


less variation in the boundary layer (note the difference in scale) the relative 
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size of the numerical wiggles for Pe = 40 is just as bad as in Fig. 4.2. 


In practical situations often Pe is so large, that it is not feasible to make A 
small enough to satisfy (4.22). However, this is not always a problem, since 
by one of three measures wiggles may be largely or completely avoided. These 
measures are: 


(i) Apply a homogeneous (b = 0) Neumann boundary condition at outflow; 

(ii) Change the discretization such that the discrete system satisfies a similar 
maximum principle as the differential equation; 

(iii) Apply local mesh refinement in the boundary layer. 


Measure (i) has just been illustrated: the global discretization error is zero in 
the present simple case. Although this is not true in more complicated multi- 
dimensional cases, with this outflow boundary condition wiggles are usually 
absent. Measure (ii) will be discussed later. Measure (iii) is numerically illus- 
trated in the next section. 


Wiggles and stability 


Schemes that allow wiggles are sometimes called instable. But this is not 
correct, as we will now show; an extensive discussion of stability will be given 
later. Stability always refers to the influence of perturbations of the data 
(boundary and initial conditions and right-hand side). Consider the numerical 
solution (4.21). Let the boundary value a be perturbed by an amount da. The 
ensuing perturbation of the numerical solution dy is given by 


õp; = ba(z7 — 2) + 24 p/2)/(z7 —1). 


The numerical solution is stable in the norm ||.|| with respect to perturbation 
of a if there exists a constant C independent of the mesh size h = 1/J such 
that 


lSp]] < Côa. (4.23) 
Let z < 0, so that we have wiggles, and let z < —1. Then 
l5ellco < 6a(2 + lpl/2)l217/127 — 1, 
so that (4.23) holds with C = 2(2 + |p|/2), since 


|jz7 —1]41 jz]? 
e7 
The case 0 > z > —1 and the case of perturbation of b is left to the reader. 


This shows that the numerical scheme, although it generates wiggles, is nev- 
ertheless stable. 


2>1+1/|z7-1|= 


120 4, The stationary convection-diffusion equation 


Exercise 4.2.1. Devise alternative ways to implement the boundary condi- 
tion dp(1)/dx = b for 


dup d_ dp, | 

Tz da ae) 9 (4.24) 
on a cell-centered grid, and also on a vertex-centered grid, with global trun- 
cation error O(h?). 


Exercise 4.2.2. We study the accurate implementation of a Neumann bound- 
ary condition for variable coefficient u. Discretize (4.24) with 


on the grid of Fig. 4.1, with boundary conditions 
y(0) =a, dy(1)/dr=b. 
Can you make at the Neumann boundary 
Ty = O(h?) 


by a O(h?) perturbation of the boundary data? Investigate the accuracy by 
numerical experiments. Take u = 14 csin nrz, c = 0.7, n=1,a=1, b=2, 
q=0, D=10-* and D=1. 


Exercise 4.2.3. Prove that the scheme (4.17) is stable by using its exact 
solution. 


4.3 Numerical experiments on locally refined 
one-dimensional grid 


Choice of grid 


We want to apply local grid refinement in the boundary layer, such that the 
mesh Péclet number p satisfies (4.22) locally in the boundary layer. We will 
solve (4.11) numerically. Boundary layer theory tells us there is a boundary 
layer at x = 1 with thickness ô = O(Pe~'). After some trial and error we 
put ô = 4/Pe. The grid is again cell-centered. In the boundary layer the cell- 
size H; = constant, and we put 1/3 of the total number (J) of cells in the 
boundary layer, hence H; = 36/J. The remaining cells also have the same size 
(H; = 3(1—8)/2J). Figure 4.4 gives a qualitative impression of the grid. Note 
that as Pe changes, the grid points move. There is a large jump in mesh size 
at the edge of the boundary layer. As discussed in Sect. 3.7, the cell-centered 
scheme is locally inconsistent in the maximum norm, but is predicted to be 
nevertheless convergent. We will see what happens in practice. 


4.3. Numerical experiments on locally refined one-dimensional grid 121 


$<} 2 OO HO 
Fig. 4.4. Local grid refinement in boundary layer. 


Numerical experiments 


Equation (4.11) is discretized on the grid just defined in the way described 
in the preceding section. Let the boundary conditions be of Dirichlet type 
(4.13) with a = 0.2, b = 1. The exact solution is given by (4.15). Numerical 
results are shown in Fig. 4.5. 


Pe = 40, 12 celts, 4 calls in boundary layer Pe = 100, 12 cells, 4 ceils in boundary layer 





0 02 04 0.6 0.8 1 “0 02 04 0.6 0.8 


Fig. 4.5. Numerical solution with local grid refinement in boundary layer 
(*: numerical solution: -: exact solution.) 


The total number of cells is 12, as in Fig. 4.2. But much better accuracy 
is obtained. Inside the refinement region the mesh Péclet number equals 
p= H;Pe = 12/J =1, satisfying (4.22) for all Pe , so that there can be no 
local extrema in the numerical solution in the refinement region, according to 
principles to be discussed in the next section. Outside the refinement region, 
p= (3Pe — 6)/J, so that p = 4.5 for Pe = 40 and p = 12 for Pe = 100, 
violating (4.22), so that local extrema can occur. But Fig. 4.5 shows that 
the numerical solution is quite acceptable, even for Pe = 100. Wiggles are 
almost absent. In order to eliminate unacceptably large wiggles as those in 
Fig. 4.2, local refinement or changing a boundary condition is preferable to 
switching to an upwind scheme, which results in a numerical boundary layer 
that is much too thick. This illustrates why the advice “look at the wiggles, 
they tell you something” is given in Gresho and Lee (1981). Wiggles are a 
signal that there are boundary layers that are not resolved, and that. local 
grid refinement is called for. 


It may happen that the small wiggles in Fig. 4.5 are unacceptable. This may 
be the case when the unknown must remain strictly non-negative for physical 
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or numerical reasons. This occurs in the modeling of turbulent or chemically 
reacting flows. Then one may apply a monotone scheme (to be discussed in 
the next section), for example an upwind scheme, on the locally refined grid of 
Fig. 4.4. An accurate numerical solution is obtained, with correct resolution 
of the boundary layer. 


Accuracy 


The maximum norm of the global truncation error e; = y(z;) — p; has been 
measured: 


llelloo = max{le;|, j= 1,...,J} 


Results are presented in Table 4.1. Apparently, e; = O(J~*), confirming the 
error analysis of Sect. 3.7. Secondly, we see that ||e/|,. is almost independent 
of Pe. This is due to the adaptive (i.e. Pe-dependent) local grid refinement in 
the boundary layer. Of course, for a given J computing work and storage are 


Pea a m0 
2 24 48 2 344 4 7 12 2 48 





Tel [0.0852 0.0230 0.0060] 0.0853 0.0230 0.0060]0.0854 0.0230 0.0060 


Table 4.1. Error norm on locally refined grid. 


also independent of Pe. We may conclude the computing cost and accuracy 
are uniform in Pe. 


This is an important observation. A not uncommon misunderstanding is that 
numerical predictions of high Reynolds (or Péclet in the present case) number 
flows are inherently untrustworthy, because numerical discretization errors 
(‘numerical viscosity’) dominate the small viscous forces. This is not true, 
provided appropriate measures are taken, as was just shown. Local grid re- 
finement in boundary layers enables us to obtain accuracy independent of 
the Reynolds number. Because in practice Re (or its equivalent such as the 
Péclet number) is often very large (see Sect. 1.5) it is an important (but not 
impossible) challenge to realize this also in more difficult multi-dimensional 
situations. A thorough analysis of Pe-uniform accuracy for a two-dimensional 
singular perturbation problem will be given in Sect. 4.7. 


4.4 Schemes of positive type 


We now discuss measure (ii) of the end of Sect. 4.2. 
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A discrete maximum principle 


Lemma 4.4.1. Let cy and £n, n=1,2,...,N be reals satisfying 


N 
tn =0, tn <Oforn> 1 (4.25) 


n=l 


and 


Then 
In =z} n=l, N or 21<max{t,z: n>1}. 


Proof. From (4.25) it follows that cı > 0. Without loss of generality we may 
take c, = 1, so that 


N 
rm<- So entn . (4.26) 
n=ž2 


This shows that zı is not larger than a weighted average of zn, n > 1, 
so that the lemma is obvious, but the proof is completed anyway. Let 
Im = Max{zrťn : n = ?,3,..., N}, and let there exist some p > 1: £p < 2m. 
Then (4.26) gives 


N 
Ti < -Em X Cy = Tm 2 
n=2 


If there is no such p then we have x] = tm, M=1,..., N. o 


Let Lp be a linear discrete operator defined by 
Lavy = $ alj kpj FHM ed, 
kek 


where K is some index set, and a(j,k) are coefficients. For example, in the 
L, of equation (4.17) we have K = {~—1,0, 1}. 


Definition 4.4.1. The operator Lp is of positive type if 
X a(j,k) =0, f=2,..,5-1 (4.27) 
KEK 


and 


a(j,k) <0, k#0, 7=2,..,7-1. (4.28) 
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Theorem 4.4.1. Discrete maximum principle. 
If Ly is of positive type and 


Lip; <0, J=2,..,F-2 
then local mazima of y can occur only ifj = 1 orj = J, orp; = constant. 


Proof, For arbitrary j € {2,..., 7 — 1} let co = a(j7,0), ck = a(j,k), k #0. 
Then Lemma 4.4.1 applies after suitable re-indexing of c. Considering each 
interior grid point in turn, the theorem follows. o 


Corrollary Let (4.27), (4.28) also hołd for j = J. Then a local maximum of 
w can occur only for j = 1. 
Proof Trivial extension of the proof of Theorem 4.4.1. D 


If Larp; > 0 for j = 2,..., J— 1 and Ly, is of positive type then it can be shown 
in a similar way that local minima can occur only for j = 1 and j = J. Hence, 
if Lay; = 0 then local eztrema can only occur for 7 = 1 or j = J. We see 
that if the discretization is of positive type, then it obeys a similar maximum 
principle as the differential equation. Hence, spurious wiggles cannot occur. 
Therefore it seems attractive to work only with schemes of positive type. 
Unfortunately, these have the following drawback. 


Order barrier 


A prime concern is to have accuracy and computing cost uniform in Pe. 
Hence, we would like to have at least an accurate and positive scheme for the 
case Pe = oo. However, we have the following theorem. 


Theorem 4.4.2. Order barrier. 
Linear discretization schemes of positive type for the following differential 
equation: 


are at most first order accurate. 


Remark A linear discretization scheme is a scheme that gives a linear system 
of equations if the differential equation that is approximated is linear. By 
‘first order accurate’ we mean that 7; = O(A), A= max{Hj,..., Hs}, if the 
discretization is scaled such that the differential equation is approximated. 
The order barrier theorem holds on both cell-centered and vertex-centered 
grids. 
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Proof. We restrict ourselves to a uniform grid with cell-size h. Hence, cell- 
centered and vertex-centered discretizations are identical. In general, in grid 
points sufficiently far from the boundaries the discretization is given by 


Livy = X ak) dite - 


kek 


The local discretization error satisfies 


ry = Lnplej) -q= Y alk) {olas) + knee 


kek 
+ tkeh ed) + O(h3)} -qj . 


For 7; to be O(A) it is necessary that 


X a(j)=0, So ka(k) = 1/h. 
kEK 


kek 


Hence, a(k) = O(h7"). For the seheme to be second order accurate it is 
necessary that in addition 


X Kalk) = 0. (4.29) 


However, since La is of positive type, a(k) < 0 for k Æ 0, so that (4.29) 
cannot be satisfied. o 


A similar order barrier theorem for the time dependent case is discussed in 
Sect. 9.2. A way to get around the order barrier is to use nonlinear schemes. 
We will come back to this in Sect. 4.8. In practice, usually a better balance 
between accuracy and efficiency is obtained if r; = O(4°) instead of 7; = 
O(A) on smooth grids. 


Monotone schemes 

A scheme for which Lay > 0 implies p > 0 is called monotone. For a mono- 
tone scheme the solution decreases (or increases) with the right-hand side, 
cf. Exercise 4.4,1. Schemes of positive type are monotone (cf. Exercise 4.4.2), 
but the reverse does not hold. 


Mesh Péclet condition 


Consider equation (4.5), discretized according to (4.8) and (4.10). This results 
in 
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, 1 
a(j, —1) = 79 “j-1/2 _- w;—1/Hj~-1/2 ’ 
. 1 
a(j,1) = Yj+1/2 — wi/Hy 41/25 (4.30) 


a(j,0) = —a(j,-1) —a(j, 1). 


Do we have a scheme of positive type? Application of conditions (4.28) for 
all interior 7 = 2,..., J— 1 results in 


—2 < Uj41/2Hj41/2/ Wj < 2. (4.31) 
Define the mesh Péclet number in the present variable coefficient case as 

Pj+1/2 = [Uj 41/2| Hj 41/2/0j - (4.32) 
Equation (4.31) results in 


Pj41/2 < 2. (4.33) 


If D(z) is smooth one may just as well replace w; by Dj 41/2, in which case 
the customary definition of the mesh Péclet number is recovered from (4.32). 
Equation (4.33) is the generalization of (4.22) to variable coefficients and non- 
uniform grids. As is to be expected from the order barrier theorem, because 
(4.30) corresponds to a second order scheme, (4.33) is violated as D(x) | 0. 


Exercise 4.4.1. Let La be of positive type, and let L,y(%) = f( , a = 1,2. 
Show that if fP < fP , j=1,...,J then pP < gl, 


Exercise 4.4.2. Show that a scheme of positive type is monotone. 


4.5 Upwind discretization 


A scheme that gives us a scheme of positive type for all Pe is obtained if 
we approximate the convective flux uy such that, in the terminology of Def- 
inition 4.4.1, a positive contribution is made to a(j,0) and a non-positive 
contribution to a(j,k), k # 0. This may be achieved by using the upwind 
discretization (4.9). It is left to the reader to show that on a uniform grid 
the local truncation error satisfies (assuming the discretization is scaled such 
that the differential equation is approximated) 


Tj = O(A), (4.34) 


in accordance with the order barrier theorem 4.4.2. 
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Artificial viscosity 


Another, at first sight more crude, way to make La of positive type is to 
maintain central discretization but to artificially increase the diffusion coef- 
ficient D by an amount D, such that (4.33) is satisfied. In other words, we 
add an artificial viscosity term to the original differential equation (4.1), and 
solve instead 

dup d_,_ dọ d de 

— — —(D—)- —(Da—) = 

dg Jz | de? ds | ain) 1 
From (4.33) it follows that in order to obtain a scheme of positive type we 
must have (assuming D smooth, and putting w; = Dj+1/2) 


Da j41/2 > \Uj4is2|Aj4is2/2 — Dj41/2 - 


This means that Da j41/2 = O(Hj+41/2) suffices, so that the local truncation 
error of the artificial viscosity scheme will be O(H) on a uniform grid, as is 
to be expected from the order barrier Theorem 4.4.2. 


At first sight, modification of the differential equation to be solved seems a 
crude stratagem to obtain a scheme of positive type, and one might think 
that upwind discretization is more sophisticated. However, the two are closely 
related, because for the following choice of the artificial viscosity: 


Daj 41/2 = \4%541/2|Hj41/2/2 (4.35) 


the artificial viscosity scheme is identical to the upwind scheme. Turning the 
argument around we see, that compared to the central scheme the upwind 
scheme introduces an additional local truncation error representing an in- 
crease of the effective viscosity by an amount given by (4.35). This is why it 
is sometimes thought that high Reynolds number (or rather, in the present 
situation, high Péclet number, i.e. D < 1) flows cannot be computed with 
linear schemes of positive type, because the numerical Péclet number is gov- 
erned by D+ Da, and is therefore O(HS A ja)" However, it has already been 
seen in Sect. 4.3 that to achieve accury and computing cost uniform in Pe 
it is necessary to apply local grid refinement in the boundary layer, where 
H; = O(Pe™'), bringing the numerical Péclet number close to the physical 
Péclet number, and making upwind discretization unnecessary even. Outside 
the boundary layer we are still stuck with a numerical! Péclet number which 
is too low. But in the numerical examples of Sect. 4.3, ¢ = constant out- 
side the boundary layer, so that the artificial viscosity term is negligible, and 
uniform (in Pe) accuracy and computing cost seem to come into reach with 
linear schemes of positive type. The situation that p = constant outside the 
boundary layer is not so atypical as one might think, for in real high Reynolds 
number flows velocity gradients are often small outside boundary layers, so 
that the artificial viscosity terms are small. We will shortly further explore 
these issues numerically. 
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Hybrid scheme 


Because of the loss of accuracy incurred it is a good idea to use the upwind 
scheme only in regions where (4.33) is violated, and use the central scheme 
elsewhere. This can be done as follows. In the finite volume discretization of 
the convection term (uy)j41/2 has to be approximated. Central discretization 
according to (4.8) is defined by 


Pe — 1 
(up)j41/2 = (up)e 541/2 = 5Uj4i/2(%7 + 541) ; 
2 


and upwind discretization according to (4.9) is defined by 


~ _1 
(uy) s4ij2 = (up)uj+i/2 = 5 (uiti2 + |ulj41s2)¥; 
1 
+ 5 (Ui+1/2 — (ujal) Pj - 


Let us define a switch function s(p;}1/2) with the mesh Péclet number for a 
non-uniform mesh p;41/2 defined by (4.32), and define hybrid discretization 
by 


(ugp)j+1/2 = $(Pj4ij2)(up)uj+iy2 + (1 — s(Pj+1/2))(up)cj+1/2; (4-36) 
which gives 
(up) j4i/2 = (Uja + 8(P341/2)Ujsi2l) ys 
+ 5 (ujsis2 7 $(Pj41/2) Uj41/2l) e541 - 


For iterative convergence in nonlinear cases it is beneficial if s(p) switches 
smoothly between 0 and 1, for example 


s(p) = 0, p< 19, 
s(p) = 10(p—1.9), 19<p<2, (4.37) 
s(p) = 1, p>2. 


The hybrid scheme has been introduced in Spalding (1972), and is further 
discussed in Patankar (1980). 


Exercise 4.5.1. Prove (4.34), assuming constant coefficients u(x) and k(z). 


Exercise 4.5.2. Consider the following problem: 


du _,d 
—~ — Pe 1 = 42) 0<z<1, 9(0)=0, p(1)=1 (4.38) 
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with 
u(x) = 1—bsinez, (4.39) 
and Pe constant. Choose the following exact solution: 
e(z-1)Pe _ ẹ-Pe 
p(z) = a(sinamz — sinar) + , (4.40) 
1 — e-Pe 


and derive the corresponding right-hand side q(z). 


Note that with a = b = 0 we have q(x) = 0, and we recover the test case of 
Sections 4.2 and 4.3. Let the grid be non-uniform, and choose it as in Sect. 4.3, 
or uniform. Implement the hybrid scheme, and include also the upwind and 
central schemes by freezing s(p) at 0 or 1. Implement defect correction (see 
Sect. 4.6). 


Define and perform numerical tests that illustrate, confirm or refute the 
points made in Sections 4.2-4.5 about occurrence of spurious wiggles and 
accuracy. Plot exact and numerical solutions as in Sections 4.2 and 4.3. Com- 
pute the maximum norm of the error, and find out how it depends on the 
number of cells J. 


(a) Solve the same problem as in Fig. 4.2, but with the upwind scheme. 
Discuss the results. 

(b) Solve the same problem as in Fig. 4.5, but with the upwind scheme. 
Discuss the results. 

(c) Solve the same problem as in Fig. 4.5, but with the hybrid scheme. Discuss 
the results. 

(d) Choose a = 1, b = 0.95, a = 3. Use the grid of Fig. 4.5. Compare the 
upwind, hybrid and central schemes. 


4.6 Defect correction 


The algorithm 


Defect correction is a method to improve the accuracy of a lower order dis- 
cretization, without having to solve for a higher order discretization. Denote 
the system of equations corresponding to a lower and a higher order dis- 
cretization by, respectively, 


Lngn = fh, Lava =gh . 
Defect correction is an iterative method defined by 


Eroi = Gh , (n—1) í 1) 


Lig’ = qh — Lag, + Eng; », n=l,2,.... 
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If La is a first order scheme and L» is a second order scheme, then already 
(1) 


p, is of second order accuracy. This we will now show. 


Accuracy of defect correction 


Denote the boundary value problem to be solved by 
Lp=q, PEŠ, qeQ, (4.41) 
with @ and Q suitable function spaces. Let 
yi”) E, n=0,1,..., 
and 
ghE Qn, GhE Qn. 
Define the following operators: 
Rnz:Q7Qn, Ra: Q>Qn, Ra: O44, 
such that 
qa = Raq, Gn = Rag 
and, on a two-dimensional grid for example, 
(Ray)ix = p(w) - 


The operators R, and Rp may be different, depending on how the boundary 
conditions are approximated. Here we need not be specific about the choice 
of these operators. The local truncation error (defined by (3.48)) of scheme 
L, can be written as 


Th = Lar Bro — qah = (Li Rp — RiLl)y . 
Assuming Ly, to be first order, we have 
Ln Rn - RaLllo,ce < C14, (4.42) 


with A the maximum mesh size. Here operator norms are defined in the usual 
way, for example 


Zalone, =sup{||Lavallo,/llenlle, : 0 E on EPa}. (4.43) 


We will come back to the choice of norms later. Assuming La to be second 
order accurate, we have 


4.6. Defect correction 131 


Zn Ba — RaLilo,ce < C24? . (4.44) 


The two schemes are assumed to be relatively consistent, in the following 
sense: 
|La — Lalloneen < C34. (4.45) 
L, is assumed to be regular: 
Li lorg < Ca- (4.46) 


The constants C1, C2, ... are assumed to be independent of A. Equations 
(4.42)-(4.46) are our set of assumptions. 


We will show, following the general framework presented in Hackbusch (1985) 
Sect. 14.2.2, that 


le — vile, = O(h?) . 
where p} = Ray. We have 
Lle — ph) = (Ral - Lr Rajo- 


From (4.42) and (4.46) it follows that 


lew - ville, < CrCsAllelle - (4.47) 
Furthermore, 
Lalo — 9h) = qh + (Ln — La)(y — ph) — Lavi 


(Ral — EnRn)y + (Li — Lae ~ Pp) 5 


so that, using (4.44), (4.45), (4.46) and (4.47) 


eo? -= ville, < A?(C2 + CiCsC4)|lelle » 
which is what we wanted to show. 


Whether the assumptions (4.42)—(4.46) are satisfied depends on the choice of 
the norms. In the Taylor expansions for the local truncation errors of La and 
L, derivatives of y occur, so for (4.42) and (4.44) it is necessary that ||p||s 
measures y and its derivatives up to the order of those that occur in the local 
truncation error Taylor expansions. To elucidate (4.45), suppose that (4.41) 
is the convection-diffusion equation, and that L» is an upwind version of La 
obtained by addition of artificial viscosity with coefficient D, = O(h), as in 
the preceding section, so that on a uniform grid 


En = Ly + Da An , 
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with An the discretization of the Laplacian. Choosing 


lvalle, = Arpal » 


equation (4.45) is trivially satisfied with C3 = Da/h. It remains to study 
(4.46). We have 


Li anllan = lAa Ez anllon - 
Assume that L, is scaled such that 
An = A/h?, Ln = (D+ Da)A/k? + B/h, 
with the elements of the matrices A and B independent of h. Then 
A,L,* = A{(D + Da)A + hB}! = (D + Da Y} (I + O(h)) , 


where the O(h) term represents the norm of a matrix of rank O(h~!) and 
elements of size O(h). To make the remainder of our line of argument rigorous 
the size of this O(h) term would have to be investigated further, which we 
will not do. Proceeding, we get 


[An Lk "anll = (D + Da) *Ilaalla, (1 + O(h), 
so that (4.46) holds with 
Cy = (D + Da)“ + (1 + O(h)). 


Numerical illustration 


We solve the boundary value problem (4.38)-(4.40). We choose a = 0.2, 
b = —0.95, a = 1. The grid is cell-centered, similar to the one shown in 
Fig. 4.4. The thickness of the refinement region is 8 = 6/Pe. The number 
of cells in the refinement region is my and in the remainder of the grid it 
is Mme. Table 4.2 presents results. One defect correction has been applied for 
method 3. In many cases the central scheme shows small wiggles, see for ex- 
ample Fig. 4.6. Figure 4.7 shows results for upwind discretization and defect 
correction. In all cases, the defect correction solution is visually wiggle-free 
and has error almost as small as with the central scheme. The accuracy of the 
upwind scheme is almost independent of Pe; this will be shown theoretically 
in a situation with a parabolic boundary layer in Sect. 4.7. For this the maxi- 
mum principle is required. The central scheme does not satisfy the maximum 
principle, and its wiggles get worse when Pe increases, which explains why 
the central scheme is not second order for Pe = 400. 


Exercise 4.6.1. Repeat these numerical experiments with the software you 
developed for Exercise 4.5.2. Compute also results for the hybrid scheme. 
Does the hybrid scheme produce better results than the upwind scheme? 
Apply defect correction to the hybrid scheme. 
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Cm my [method | jele | method | [ef [method | Jef 
e = 40 


ee a 
0.1087 2 0.1033 0.1037 

0.0412 2 0.0284 0.0284 

0.928 2 0.0074 0.0074 

‘act a) 0.1068 3 0.1063 

is D i 0.0471 2 0.0289 3 0.0288 


Table 4.2. Maximum norm of the error for problem (4.38)~(4.40). Method 1: up- 
wind discretization; method 2: central discretization; method 3: defect correction. 
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Fig. 4.6. Solution for Pe = 40, me = 8, my = 6. — : exact solution; * : numerical 


solution with central discretization. 


4.7 Péclet-independent accuracy in two dimensions 
Problem statement 


In this section we study the following version of the two-dimensional station- 
ary convection-diffusion equation: 


Ly = (uaF), a — (Df a)a =0, @=1,2, (21,22) € (0,1) x (0,2). 
(4.48) 


We assume that we have solid walls at z3 = 0,2 so that u2(#1, 0) = u2(21, 2) 
= 0. Let u < 0, so that zı = 0 is an outflow boundary. In view of what 
we learned in Sect. 2.5 we choose a homogeneous Neumann boundary condi- 
tion at zı = 0, and Dirichlet boundary conditions at the other parts of the 
boundary: 
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Per40 upwind scheme 14 ceils 1 deicorr 


a 
is] 
rte 


0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 


Fig. 4.7. Solution for Pe = 40, me = 8,my = 6. — : exact solution; * : upwind 
discretization; + : defect correction. 


y,1(0, £2) = 0, y(r1,0) 


(zı 
y(@1,2) = g°(z1), (1, 22) 29 


= g'(21), 
= g?(z2). (4.49) 


We will not discuss the three-dimensional case, because this does not provide 
new insights. 


Our purpose in this section is to show, as in Sect. 4.3, but this time more 
rigorously and in two dimensions, that (4.48) can be solved numerically such 
that accuracy and computing cost are uniform in Pe. Therefore fear that it 
is impossible to compute high Péclet (Reynolds) number flow accurately is 
unfounded, as argued before. We now make our case even more convincing. 
Our treatment will be detailed and technical, but elementary. 


In order to illuminate the main point with as little technical detail as possible, 
we specialize to u = —1, uz = 0 and D constant. Furthermore, the problem 
is assumed to be symmetric around z? = 1, so that the solution needs to be 
computed only in half the domain, namely (0,1) x (0,1). Equation (4.48) is 
rewritten as 


Lp =-P1- EP aa = 0, 
with 
e=1/Pe= D/u 


where Pe is the Péclet number. The purpose is to find a discretization with 
accuracy and number of grid cells uniform in £ as € | 0. 
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Choice of grid 


In view of Sect. 2.5 we expect a parabolic (because u? = 0) boundary layer at 
£ = 0 with thickness O(e!/*), Because a homogeneous Neumann condition 
is applied at the outflow boundary, there is no boundary layer at zı = 0. 
In order to make accuracy and computing work uniform in e we choose a 
grid with local refinement in the boundary layer, as sketched in Fig. 4.8. The 
region of refinement has thickness ø. The choice of o will be discussed later, 
and is such that the boundary layer falls inside the refinement region. The 
refined part of the grid is called Gz, the interface between the refined and 





Fig. 4.8. Computational grid 


unrefined parts is called [ and the remainder of the grid is called Ge. The 
mesh sizes in Gy and Ge are uniform, as indicated in Fig. 4.8. Note that the 
location of the horizontal grid lines depends on €. 


Finite volume discretization 


The cell centers are labeled by integer two-tuples (i, j) in the usual way: 92;; 
is the cell with center at (z;, y;). Hence, for example, (i+ 1/2, j) refers to the 
center of a vertical cell edge. Cell-centered discretization is used as described 
in Sect. 3.5 (two-dimensional uniform grid) and Sect. 3.7 (one-dimensional 
nonuniform grid). For ease of reference the discretization is summarized be- 
low. The finite volume method gives 

[cea = Pi PSN 

fy; 


with the fluxes F! and F? given by 
Fl= —p-—EY1, Fe = —Ep~2. 
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Using upwind discretization for the first derivative, the fluxes are approxi- 
mated as follows: 


Flag S -Kifer + elpis = pij) /H} 
Ffig = —2Hie(pijy — Pi) /(K; + Kjt) , 


where K; is the vertical dimension of 92;; : K; = hg in Gy and K; = H3 
in Ge. The cells are numbered i = 1,..., Z and j = 1,...,J in the zı- and 
zə-directions, respectively. 


In order to facilitate the error analysis that is to follow we place additional 
unknowns 9; 1/2, $741/2,; on the Dirichlet. boundaries and extend the system 
of equations with the boundary conditions: 


Pi1/2 = g; » PI+1/2, = g; . (4.50) 
The boundary conditions are implemented as in Sect. 3.5. This gives 
Pias = Kjfij, 


Froiyag = Kiler; — 2elprzi — 915)/Fi} , 
Pa —2Hie(pi 1 — Pi,1/2)/h2 ; 
2 

Fi y41/2 0. 


The discrete system, including (4.50), is denoted by 
Lep =q. (4.51) 


Local truncation error 


The local discretization error is easily found from (3.50), (3.67) and (3.69). 
However, the (A?) terms in these equations depend on Pe = 1/e. Because 
we want to study the accuracy as € | 0, the dependence of the O(A?) terms 
on £ has to be exhibited. 


Let us write the local truncation error as rT = 71+ 77, where r® is associated 
with the z,-derivatives. By Taylor expansion one finds, to leading order in 
the mesh sizes, writing x = 21, y = zz in the remainder of this section, 


Intl < HKO, j=l. J, (4.52) 
Ih] < H{Kj;C,, 1=2,..,7-1, j=l. J, (4.53) 
Ir] < MKC, j=l. J, (4.54) 
IAI < HiheCy, i=1,...J, (4.55) 
IZ | < MiAgCs, i=1,..4,/, j=2,..,3r—1, (4.56) 
In? | < MiH}Ce, i=l. I, Ga=jrt2e,J-1, (4.57) 
I2 < H, H?C3, i=1,..,/, (4.58) 
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with jr such that the cells §2;;, are just below the interface I’ (cf. Fig. 4.8), 
and, replacing zı by z and zz by y, 


Cı 


mlg al (2, y) € Q) + sup{| ČE: (2, y) € 2}, (4.59) 


Ta 


Cz = wlz z F|: (z, y) € 2} + sup{| (z,y) E 2}, (4.60) 


rg 


Cy = supli 22): (2,y) € 2) +l: (ey 2}, (461) 
Os = supll Fe yen). (4.62) 
Os = sur (l SFE: (e) ER), (4.63) 
Cs = voli EM}. (4.64 
Cr = sadla l: 0<z<1,1-H:<y<1}. (4.65) 


Here N; is the part of the domain 2 that is covered by the refined grid Gy, 
and Ne = §2 \ §2;. 


For the cells adjacent to I’ we need expansions similar to (3.50). We find, for 
i=l, d, 


Tip = -žep 2Hi (H — h2) — Zepi (He + 3h2)(H2 — h2) 
-735le}) 222 {h + Elha + Hay}, (4.66) 
Thirtl = Fep M(H - hg) - geen (hi + 3H2)(H2 — he) 
-Z eple}) 22H {H3 + (h + H,)?}. (4.67) 


Where no argument is given the derivatives of the exact solution (æ) are 
evaluated in the same point as 7%“ ; a} stands for various points in the 
vicinity of F. More precisely, we have 


epEX, 


with X the union of the two rows of cells adjacent to J’. On the Dirichlet 
boundaries, because (4.50) is exact, 


Ti 1/2 = T141/2,j =9. 


Because of finite volume integration, a factor H, A; has been gained. Hence, 
a second order scheme has r = O(A‘), A = max{H,, K;}. We see that 
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(due to upwind discretization) the scheme is first order in the interior and at 
Neumann boundaries, but inconsistent in cells adjacent to F and to Dirichlet 
boundaries, in the maximum norm. This at first sight unfavorable property of 
cell-centered discretization on nonuniform grids is why we have chosen cell- 
centered discretization; vertex-centered discretization turns out to be consis- 
tent. It will be seen that nevertheless the global accuracy is O(H, + H2). 


The discrete maximum principle 


The concept of operator of positive type of Sect. 4.4 is easily generalized to 
two dimensions. Define temporarily j = (j1, j2) , k = (kı, k2) with ja and ka 
integers. Then the discrete system (4.51) can be written as 


Lepi = ` a(j, k)Pi+k = qj; 
KEK 


with K = {(0,0), (0,41), (£1, 0)}. Define 
G= {1,...,J}x« {1,...,J7}. 
We say that Le is an operator of positive type if 


Š a(j,k) =0, a(j,k) <0 for k#(0,0) WIEG. (4.68) 
keK 


It is easily seen that condition (4.68) is satisfied. Hence (cf. Sect. 4.4) if 
Ley; < 0 » JE G ’ 


then y cannot have a local maximum for j € G. Therefore the maximum of 
y is to be found on the Dirichlet boundary Gp, with 


Gp = {1,.. I} x {1/2} U {7 4+ 1/2} x {1,..., J}. 


Error estimation with the maximum principle 


By » < Ņ we mean 


and by |p| we mean the grid function with value |y;| in æ;. The global 
discretization error satisfies 


Lee=r. 


Suppose we have a grid function E, which will be called a barrier function, 
such that 
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L-E > |r|. (4.69) 
Then 
L.f{e—-E)<0. 
According to the maximum principle, 
e— E<max{-E;: j€ Gp}, 
where we have used e; = 0 for j € Gp. Furthermore, 
L.(-e- E) <0, 
so that 
—e — E < max{- E; : jE Gp}. 
If follows that 
le| < |E] + max{|E;|: j € Gp}, (4.70) 


which is the desired estimate. 


Expansion for global truncation error 


From now on, i, j and k are again integers. Similar to (3.63) we write for the 
global truncation error e;; = p(æij) — pij: 


4 
e= 5 e* (4.71) 
k=1 


We choose e!, e? and e? such that L,(e' + e? + e?) equals the first two 
terms in (4.66) and (4.67) and is of higher order for j Æ jr,jr +1; e* will 
be estimated with the maximum principle. Let 


ef = Ve(@is)ue, k=1,2,3, 
with 
wy = 0, j<Sjr, wp=HZ-hy, j>2jr+l, 


a l 
H =O, j<jr, Hiri = gg H+ 3h)(H3 — h), 


4 
1 . . 

u = Wit gin(Hy—-h}), j>ir+2, 
w= 0, ILIJE, 

1 o 
Hj = ~3(Hz +h) (H3 - H3), j>jr+l, 

l 

Yi = 3222, Y2= P222, Y3 = Y1,2. 


8 
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We have 
Le{pjp(wiz)} = pj Lev(wi;) (4.72) 
—ep(æi;){uj+1 > Hy) /Ky4ay2 — (Hi — Hj-1)/K;-1/2} 
1 
—€0,2(@ ij) (i41 — Hj—1) + O(SHjeHa,22(@")) , 


where æ* is some point in the cells (7, j) or (i, j +1). Lep can be estimated 
by (4.52)-(4.65), replacing p by ẹ%. After some cumbersome manipulations 
we find, to leading order in the mesh sizes: 


Laet = ri47?, 
~2 2 e ; ; 
Tij = Tijs t=1,../, j<jr-1, 


ee < A HyCs , i= lend, ir <j<d- 1, 
1 
Cs = supt| Se, 2222] LEXUR}, 
[al < 9HH?C7, i=1,..,1. 


Note that e!, e?, e? are O(H2). It remains to estimate e*. In fact, we have 
taken out the O(1) local errors near the boundary I of the refinement zone, 
and shown that these give only a second order contribution to the global 
error. 


Error estimate 


We will estimate e* with the maximum principle. Without any risk of con- 
fusion we will not distinguish between grid functions and functions of x. A 
barrier function is chosen as follows, in a dimension-by-dimension approach 
that is extendable to more dimensions: 


E(x) = B1 E (x) + B2E°(y) , 


with 


li 


(e) = 1+(1-2)(2+2+ H), 0<2<1-H/4, 
(z) = 0, z>1-HĦ/4, 
E?’(y) = 0, O0<y<ho/4, 
(y) = 1+y(20 + 30° — y)/a?”, h/d<y<ao, 
(u) = E*(o)+(y-e)(3-y), o<y<1, 


Ii 


where o = (jr — 5)he. We find, for j = 1,..., J, 
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L-E} > KjH,, i=2,...,T-1, 
LEi; > Kj. 


Furthermore, for i= 1,..., J, 


LE}, > 2Hi/h2, 
LE, > 2Hyhz/o?, j =2,..,jr—1, 
L-E} > 5eHi Hs j=dr,.,J-1, 
LE} > eH, jad. 
With 
Bı = Hı max{Ci , C2, C3} 
we find 


L:-B E' > |r'|. 
Furthermore, let 


Bz = €~' max{h2C4/2 , h2Cs07/2, 9H2C7 , 2H2Cg/3} . 


Then 
L-B,E® > |r°|. 

Therefore, E satisfies (4.69). Because 0 < Et < 4 and 0 < E? < 5 we have 
|E| < 4B) + 5B, . 


Hence equation (4.70) results in the following estimate: 
le*| < 8B, + 10B. 


The other terms in the error expansion (4.71) are easily estimated. We have, 
to leading order in the mesh sizes, 


1 
< =H} sup{ly.22|: æ € 2}, 


le*| 
— 8 
1 
le*| < Ha sup{|y,222]: Œ E€ Re}, (4.73) 
1 
le] < SH3sup{|p,222|; @ € 2}. 


16 
Note that for ¢ fixed 
e=O(H,+H2), 


so that the fact that the scheme is locally inconsistent in the maximum norm 
near I’ and the Dirichlet boundaries does not affect the global truncation 
error, as was made plausible in Sect. 3.7 for the one-dimensional case. 
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Dependence of the error on £ 


We now turn to the main purpose of this section, which is to show how the 
accuracy and the computing work can be made independent of £ as e | 0. For 
the analysis of the dependence of the bound obtained on € we need to estimate 
derivatives of p. For this the first term of a matched asymptotic expansion 
for the solution will be used. In order to show rigorously that higher order 
terms in this expansion may be neglected a laborious analysis of higher order 
terms would be needed, from which we will refrain. Numerical experiments 
will give further validation of the results obtained. 


For the terminology and the method followed, see Sect. 2.5 and the literature 
on singular perturbation theory, for example Eckhaus (1973), Kevorkian and 
Cole (1981) Van Dyke (1975). The outer equation is 


—ðpo/ ðr =9, poll, y) =93(y) . 


Its solution, called the outer solution, is given by 


po = 9° (y) . 


The inner or boundary layer solution y, that approximates y — wo is the 
solution of the following boundary layer problem: 


dyn P 


Dr cay = O<ze<1, 0<y¥<w, 


(4.74) 
poll, y) = 0, polz, o0) = 0, polz, 0) = glz) ’ 
with 
g(z) = gi(z) ~ g3(0) . 


In order to avoid a corner singularity it is assumed that gi(1) = ga(0). The 
solution of (4.74) is given by 


_ 2 T —1;2 y? 
Yo = V2 f e gl + say )dt. 
y/f2e(1-2) 


A uniformly valid asymptotic approximation to y is given by yy + yo. We 
find, using g{1) = 0: 


2 LA j , 2 
ovo __} — J e738 g (x Y dt. 


y/rf2e(1-z} 
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Assuming |g (a)| < Mı, 0 < z < 1 we obtain 


2 wis <- = Mierfe( y/2Vé) . (4.75) 





Using the inequality 


2h e77" <erfe(z — PINT e7 z 
z4 V242 < felz) < z7 z? 44/7 (29) (4-76) 


we may without losing much sharpness replace (4.75) by 


fae 


4 l —y?/4e 
D S Je ee [ae (4.77) 


= f TE y+ y? + l6e/r 


It also necessary to estimate 04, /0y*. In order to avoid a corner singularity 
at (1,0) it is assumed that 





Then 





Assuming lg (x)| < M2, 0 < z < 1 we obtain 





ot l 
[SI < z Mrerfe(y/2V), 


Again using (4.76) one obtains 


at 4 1 
Oe) A yg, Ee, 
dy Vrei yt Jy? + 16e/n 


Finally, 0°y;/dy? has to be estimated. We obtain 


A wp _ yz f g" y? 


y/4/2e(1-2) 


= -+2 j: +e — ie"? dt. 


y/r>f/2e(1—-z) 





Hence 
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By, 1 2 F “ap 
| ays |< Lm? J t? — lje at dt . (4.78) 


yf J2e 





Let us consider the estimate for e'!) given in (4.73). Using p = vp + po we 
obtain, using (4.77), 
ap 2 1 ~o7 /4e 

sup (58 : (2, y) E Qe} < M3 + ae Mi we I4e 


7 





with 
Mz = sup{|g3(y)|:0<y < 1}. 
For el?) and el) we need, using (4.78) and assuming o/V/2e > 1, 


1 2 T 
Ma + ~M] = J (4? — We 3" dt 
EC T 


af/2e 

l 2 
Ma + ¢73/2?M,—=e~? /4e 

vr 


: (£, Y) € Re} 


lA 





Py 
sup{ | ay 


where 
M4 = sup{|g* (y)|:0<y< 1}. 


For Bz we have to estimate C4, Cs, C7 and Cg. For this is needed: 


: (x,y) E 25} < Mi/e + M3, 


ay 
sup{ Oy? 








: (@,y) E Rs} < Mo/e’ + Ms , 





dtp 

sup{ yi 

4 
sup{ 





el, -3/2 2 —6* j4e 
Fyi : (x,y) EXUN:} <M; +e [° Mz-7zðe / ) 





:0<2z<1,1-H<y<1}Ś 





ð? 
solla 


Ma + Lm? J (t? — 1)e73* dt = Ma + cM ee , 
1/J/2e 
where 
=ø- hz, Ms =sup{ig?” (y)|:0<y< 1}, 
and where we have assumed 


j/V >l. 


Finally, the z-derivatives are O(1). 
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Thickness of the refinement region 


We now have to choose o and fz such that the error e! + e? + e? + et is 
uniformly bounded in £. Choosing 


o = V—6elne (4.79) 
we see that 
e1 + e2 + e3 = O(H?) 


uniformly in ¢. Furthermore, to have Bz uniformly bounded it is necessary 
that 


he = Hz y —ce/ Ine ; (4.80) 


with c some positive constant. Then we have 


~h3C4/2 < “cH (M; + ¢M3)/Ine', (4.81) 

1 

5 hase” < = cH}(Mz + €?M) , (4.82) 
2c, < 2 H?(M +732 M l e-1/45) (4.83) 
E 207 ™ 8 2 4 t T ' ` 

2 2 1 

zz H3Cs < gHa( e Y- ne Ms + Ms). (4.84) 


In (4.83), the difference between o and & has been neglected, which is asymp- 
totically correct. Uniform boundedness of Bz follows from (4.81)-(4.84). The 
following error estimate results in the maximum norm: 


lell = O( Ai + H3) (4.85) 
uniformly in €. 


Equations (4.79), (4.80) and (4.85) are our main results. Although the goal of 
uniform accuracy has been achieved, the computing work increases (slowly) 
as £ | 0. If there are ne horizontal grid-lines in Ge, then the number of 
horizontal grid lines ny in Gy satisfies 


ny = O(nelne™t), (4.86) 


which follows from (4.79) and (4.80). However, numerical experience shows 
that the logarithmic factors in (4.79) and (4.80) come into play only for 
unrealistically small values of H2, so that in practice one may work with 


o= Ove), hy = O( EM?) - (4.87) 
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Now the computing cost is uniform in £, whereas numerical experiments still 
show uniform accuracy. 


The analysis of vertex-centered discretization, where the nodes are the ver- 
tices of the cells in Fig. 4.8 rather than the centers, is easier, because an error 
expansion like (4.71) is not required. By using the maximum principle in a 
similar way, (4.79), (4.80) and (4.85) may be derived. 


It is also possiblle to allow in addition to the parabolic boundary layer at 
y = 0 an ordinary boundary layer at r = 0. An additional refinement re- 
gion of thickness O(c Ine~!) needs to be introduced at z = 0 with mesh size 
hy = O(Hyelne~*). The same method of analysis can be followed. 


In Farrell, Hemker, and Shishkin (1996), Hemker and Shishkin (1994), 
Shishkin (1990) it is proposed to choose 


ð = Ov/elnny > AJEN, (4.88) 


so that the computing cost is uniform in £e. The corresponding grid is called 
a Shishkin grid. By a less elementary proof than above it is shown that the 
accuracy is uniform in £, but a result less sharp than (4.85) is obtained. 
Numerical evidence points in the direction of (4.85). 


Because the analysis uses the maximum principle, upwind discretization is 
necessary. But in practice, as will be shown, central discretization also gives 
good results, with O(H? + H2) accuracy. 


Numerical experiments 


The following exact solution as assumed: 


y 


__ 1 7 (2 — y)? 
2 = Jr fox nt XP 40 wh 


The right-hand side and boundary conditions in (4.48) and (4.49) are chosen 
accordingly. Because the solution is extremely smooth in 2, it turns out that 
in Ge the number of cells in the vertical direction can be fixed at 4; the 
maximum of the error is found to always occur in Gj. We take 


og = 8Ve, 
neglecting the logarithmic factor in (4.79). 


Table 4.3 gives results for the cell-centered upwind case. Equation (4.85) is 
confirmed. Exactly the same results are obtained for € = 1075 and £ = 1077, 
showing ¢-uniform accuracy, despite the fact that the logarithmic factors in 
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(4.79) and (4.80) have been neglected, to make work and storage indepen- 
dent of £. The maximum error occurs in the interior of the boundary layer. 
Table 4.4 gives results for central discretization. Visual inspection of the re- 


8 | 32 54 
32 | 64 14 
128 | 128 3.6 


Table 4.3. Maximum error as function of number of grid-cells for € = 107*; cell- 
centered upwind discretization. nx: horizontal number of cells; ny : vertical number 
of cells in Gg. 






16 92 
32 


64 
128 





Table 4.4. Cell-centered central discretization; € = 107°. 


sults shows no visible wiggles. The rate of convergence is somewhat worse 
than the hoped for O(H? + H2), but here again the same results are ob- 
tained for e = 1077, showing uniformity in €. 


Table 4.5 gives results for the vertex-centered upwind case. A rate of conver- 


8 | 32 69 
32 | 64 15 
128 | 128 3.8 


Table 4.5. Vertex-centered upwind discretization; € = 107°. 






gence of O(H, +H?) is clearly demonstrated. The accuracy is about the same 
as for cell-centered discretization. Exactly the same results are obtained for 
e = 107° and 1077, again showing ¢-uniform accuracy. 


Finally, table 4.6 gives results for vertex-centered central discretization. The 
rate of convergence is O(H? + H2). Further refinement shows that the finest 
grid of table 4.6 is the finest that rounding errors allow. No wiggles are ob- 
served. Results for € = 107° and £ = 1077 are virtually identical. 
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8 | 16 30 
16 | 32 6.76 
32 | 64 1.88 


Table 4.6. Vertex-centered central discretization; € = 107°. 






Further numerical experiments, including grids choosen according to (4.86) 
and (4.88), may be found in Wesseling (1996b). 


We may conclude that in practice work and accuracy can be made to be 
uniform in £, both for cell- and vertex-centered discretization, by suitable 
local mesh refinement according to (4.87) or (4.88). Hence, in principle, high 
Reynolds number flows are amenable to computation. 


4.8 More accurate discretization of the convection term 
Introduction 


In many applications in fluid dynamics the Reynolds number satisfies Re > 1, 
and as a consequence the diffusion terms are small in large parts of the 
domain. It is therefore especially important to approximate the convection 
term accurately, the more so because in inviscid flow models, such as the Euler 
equations, diffusion terms are not present. When improving the accuracy of 
the discretization of the convection terms, the order barrier on schemes of 
positive type (Theorem 4.4.2) makes itself felt. In order to obtain higher order 
accuracy without spurious oscillations , a way around this barrier has to be 
found. This leads to the introduction of flur limiting. 


The «-scheme 


Let us reconsider equation (4.5), discretized with the finite volume method 
on the grid of Fig. 4.1 according to (4.6). Let us split the flux function F in 
its convective and diffusion parts: 


F= F°+ F", Fe=uy, F” =—Ddp/dz. 


As compared to the second order central scheme (4.8) or the first order up- 
wind scheme (4.9), with the x-scheme one tries to enhance the accuracy by 
using an additional cell-center value, namely pj-1 for uj41/2 > 0 or pj42 
for uj;41/2 < 0. In order to maintain second order accuracy, to the convective 
flux function of the central second order scheme a higher order term is added, 
as follows: 
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1 K 
Firipa = Ujtip2t5 (ei + Pita) + le-a + 2675 - Git} Uj41/2 20 


c 7 1 l-k 
Fiya = tjat gles + esta) + (es + 26541 — Git2)} Ujta <0 
(4.89) 


The approximation to the convection term becomes 


dup 

dz j 
This one-parameter family of schemes, introduced by van Leer (1977}, will 
be called the «-scheme. If u = constant > 0 the stencil is given by 


IR 


l č c 
h; Fite T Fir) - (4.90) 


ae 38k -—5 3-3k lek 0 


For « = —1 we obtain the one-sided second order upwind scheme (Warm- 
ing and Beam (1976)). With « = 0 one obtains Fromm’s zero average phase 
error scheme (Fromm (1968)), if in this scheme the terms quadratic in the 
Courant number are neglected, which are meant to improve time accuracy, of 
no concern here; this scheme results from optimizing, among 5-point schemes, 
for the propagation of a step function over one time step in the absence of 
diffusion (Wesseling (1973)). For « = 1/3 the third order upwind biased 
scheme of Anderson, Thomas, and van Leer (1986) results. For x = 1/2 we 
have the QUICK (quadratic upstream interpolation for convective kinemat- 
ics) scheme proposed by Leonard (1979). This corresponds to approximation 
of 41/2 by interpolation of highest possible order (i.e. quadratic) between 
three surrounding grid points, but does not lead to a local truncation error of 
maximum possible order (cf. Exercise 4.8.1). Finally, x = 1 gives the central 
scheme (4.32). 


The «-scheme provides a convenient framework for formulating upwind bi- 
ased schemes. For x = 1/3, the «-scheme is third order accurate; for all other 
values of «, it is second order. 


A remark on the local truncation error 


The global and local truncation errors in grid point j have been defined in 
the preceding chapter as 


ej =p(zj)— Pj, Tj = Lae; , 


with v(x) the exact solution and La the discrete operator. It is assumed that 
La is scaled such that it approximates the differential operator. Let D = 0, 
and assume 7; = O(h?) and e; = O(h4). Since (4.90) is an exact consequence 
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of the differential equation and contains no error, it may come as a surprise 
that q is not maximized by making Peeaye as accurate as possible, which 
is the case for « = 1/2, resulting in p = q = 2. Instead, q is maximized 
for x = 1/3, resulting in p = q = 3. The underlying cause of this apparent 
discrepancy is, that (4.90) is not equivalent to the differential equation. It 
follows from the differential equation, but the differential equation does not 
follow from (4.90). The situation is illustrated in Exercise 4.8.1. 


The fourth order central scheme 


Again using a 5-point stencil, but no upwind bias, 7; = O(h*) on a uniform 
grid is obtained by 


e ~ 7 1 
Presse = tjit yg (es + Pitt) — Tali- + Pit) 


If u = constant the stencil for the resulting approximation of (dug/dx), is 
given by 


u 


al -808 -1]. (4.91) 


Approximation of y;41/2 by interpolation to highest order gives a different 
stencil, with 7; = O(h*), cf. Exercise 4.8.1. 


Nonlinear schemes 


Because the order of accuracy of the preceding schemes is higher than one, the 
order barrier Theorem 4.4.2 predicts that these schemes are not of positive 
type. This is immediately confirmed by a look at their stencils. Hence, the 
maximum principle of Theorem 4.4.1 does not apply, and spurious wiggles 
may occur. Of course it is desirable to combine higher order accuracy with 
the positive type property, in order to avoid spurious wiggles. To achieve this, 
a way around the order barrier theorem has to be found. This may be done 
by using nonlinear schemes, i.e. schemes that are nonlinear, ever though 
the differential equation that is approximated is linear. A great variety of 
nonlinear additions to linear schemes has been proposed in the pursuit of 
positivity. Many of these can be formulated in a unifying framework called 
the normalized variable diagram, which we will first discuss. A different but 
equivalent way to formulate nonlinear schemes of positive type is provided 
by the frequently used concept of flux limiting. This will also be discussed. 
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kappa = -1 kappa = 0 kappa = 0.33333 





0.5 1 


Fig. 4.9. Normalized variable diagram: f = f(¢). o : the point P = (1/2,3/4); 
---: «scheme; — : f(¢). 


Normalized variable diagram 


Point of departure is an upwind biased linear scheme, for which we take 
the «-scheme (4.89). For brevity we assume u; > 0; the contrary case may 


be handled by symmetry. In Gaskell and Lau (1988), Leonard (1988) the 
following normalized variable is introduced: 


~ p — ~j-1 
p=. 4.92) 
Pj+1— Pj-1 l 


From (4.89) it follows that for the «-scheme we have 


_ 2-K. l+k 
Piti = —y_¥5 + (4.93) 


The graph of $;41/2 versus %; is called in Leonard (1988) the characteristic 
of the scheme and can be plotted in the normalized variable diagram, see 
Fig. 4.9 for example. A general nonlinear scheme is defined by 


Pj41/2 = f(G;). (4.94) 
For the «-scheme, f is linear. 


If u; < 0 we get by symmetry: 
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~ z ~- P ¥j+2 
Piti = Fin), = m, (4.95) 
Pi — Pi+2 
and the function f(¢) remains the same as for u; > 0. 


The order of accuracy of the nonlinear scheme defined by (4.94) can be stud- 
ied as follows. Let u = constant. Then ~j41/2 — j~-1/2 has to approximate 
h;(dip/dzx);. Let the grid be uniform. In order to be able to analyze the local 
truncation error by Taylor expansion, f is approximated locally by, assuming 
that f° is bounded, 


He= 1) +-PFH+0(a-2) EEO 
One finds 
Pini Pinay = hldp/de); + (Pp/da”);{2F(5)- 5} (4.97) 
+h°(@p/dz®);{2 — (5) — sf GD + ORS). 


We see that the scheme is first order accurate for arbitrary fG), which justi- 
fies the choice made for the normalized variables, and the decision to develop 
f(%j) arround ġ; = 1/2 in (4.96). If 


f(1/2) = 3/4 (4.98) 


the scheme is second order. As expected, the «-scheme satisfies (4.98) for all 
K. Finally, if 


f (1/2) = 5/6 (4.99) 


the scheme is third order. As expected, the x = 1/3 scheme satisfies (4.99). 
In other words, the characteristics of second order schemes pass through the 
point P = (1/2,3/4) in the normalized variable diagram (Fig. 4.9), and for 
third order schemes are tangent in P to the characteristic of the « = 1/3 
scheme. 


Next, we derive further conditions to be satisfied by f. In order for the scheme 
to exclude spurious wiggles and to be more than first order accurate, f has to 
be nonlinear. We consider a number of possibilities for the values of p;—1, 9; 
and 41, see Fig. 4.10. If p; E [~j;-1, %j 41] (i.e. 0 < ğ; < 1) the distribution 
of values is monotone, and monotonicity is preserved if we choose ;41/2 E 
(yj, 541]. For f(%;) = 9341/2 this leads to 


Ži < f(g) <1, OSG <1. (4.100) 


fp; Æ [pj-1, 4; 41] the distribution of values is not monotone, and in order to 
steer the solution to monotonicity the first order upwind scheme 9; 41/2 = Pj 
is applied, resulting in 
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j-l jj j+1 


Fig. 4.10. Various values of y; in comparison to yj-1 and pj+1- 


f(Gj)) = Si, 9) q (0,1). (4.101) 


Finally, we remark that in the neglected O(h*) term in (4.97) there are terms 
proportional to f”, so that it seems advisable to keep f° bounded. Examples 
of nonlinear characteristics satisfying these criteria are given in Fig. 4.9. In 
the origin and in (1,1) f” is not bounded (since here the characteristic is 
continued with slope 1 according to (4.101)), but this is allowed, since here 
the scheme switches to first order upwind, so that lower order terms dominate 
in the expansion (4.97). Shortly, we will show that it is desirable to sharpen 
(4.100). 


The flux limiter 


A different way to formulate nonlinear schemes is as follows. In order to 


choose the flux Fe pij = Uj41/2Pj41/2 We may also write, instead of (4.94), 


1 
Pi+1/2 = Pj + 5 (ra) (Pi+4 — pj) . (4.102) 


Van Leer (1974)has proposed to make 7 a function of the ratio of consecutive 
gradients: 


_ ¥j — ¥j-1 
r; = =——. 
Pitl — Pij 
The function y(r) determines a correction on the upwind flux u;41/2~;, and 


is called the flux limiter. For reasons to become clear in Sect. 9.4, ~ will later 
also be called slope limiter. A survey of flux limiters (in the nonstationary 
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case) is given by Sweby (1984). The relation between the flux limiter y and 
the normalized variable function is as follows. We have 








Pi - rj 
r= ğġ= , (4.103 
7 1-9 , ) 


Equation (4.102) can be rewritten as 
~ ~ 1 ~ 
Pit1s2 = Pi + 5¥(rs)(1 — $5). 


Comparison with (4.94) and substitution of (4.103) gives the following rela- 
tion, suppressing indices: 


l 
FP) = dr +Ver), vlr) = AFG) - p/p). (4.104) 
From (4.104) follows that if f satisfies (4.100) and (4.101), then 7 satisfies 
0<v(r) <2, O0<r<oo; vi(r)=0, r<0. 


Flux limited schemes of positive type 


We will derive conditions for flux limited schemes to be of positive type. Let 
us consider the general case where u(x) may change sign. The scheme for the 
convection term is written as 


l cjJ +1/2 c 
Lnej = hy nave » Fea = (ue)jtiye- (4.105) 
If uj41/2 > 0, p;+1/2 18 given by (4.102). If uj41/2 < 0, upwind bias is 
applied in the other direction. By symmetry we obtain: 
1 1 
Pj+1/2 = Piti + 50(——)(¥5 Piti), Uj4iy2 <0. (4.106) 
200541 
The discretization in the case where uj;41/2 is arbitrary can now be written 
as 
j+1/2 = F} aj + Fiz1/2 , FR= (up*) 44/2 , (4.107) 
1 
PT2 = Pj +t 5 (rs) (Pi44 — ~j), Ujsts2>9, 
Pip =0, Ujtij2 <0, 
- 1 1 
Pi+iz2 = Pitt 3 Ne — Pj41), Ujsrs2 <0, 


Figi = 9, Ujyay 29, 


1 -;j+1/2 
Lrj = pE tF MEI- 
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In order to derive conditions on the limiter y(r) for scheme (4.107) to be of 
positive type we write it in the following form: 


Lag; = {aj (pjt — pi) + 55 (Yj-1 — G5) }/ hy (4.108) 
where a; and b; are functions of ;, 9341, Yj+2,.... We can write 
a: = Fi+1/2 ~ Peay ; Pi 41/2 7 Pi-1/2 
J a — ’ 
Pi+1/2 — Pj-1/2 Yjt1 ~ Pj 
+ + + + 
b: = Frau Fj-  Pj+1/2— Pj-1/2 
j = . 


it? jn 
Pi41/2  Pj-1/2 


Pi-1— Pj 


Note that dF~/dp~ < 0 and dFt+/dyt > 0, so that by the mean value 


theorem we have 


Piyaz 7 Fj- <0, 


Prstj2~ Pj-1/2 


+ + 
Fj41/2 7 Peay 


0. 
+ + 2 
Pi+i/2 7 Fj-1/2 


Comparison of (4.108) with Definition 4.4.1 shows that for the scheme to be 
of positive type it is necessary and sufficient that 


a; <0, 


It follows that we must have 





b <0. 


- - + + 
Fj+1/2 T Pj-1/2 ~ 0 Pj+1/2 ~ Pi-a/2 < 0. (4.109) 
Piti Pi Pj-1- Pi 
Using (4.102) and (4.106) we see that we must have 
2— (r+ >o, Ws ER. (4.110) 


This is necessary and sufficient. To have a;41/2 and 6;41/2 bounded we must 


also have 


Pj41/2 7 Pj-1/2 


+ + 
Pi+i/2 T Pj-1/2 


< M , > -M , 
Pj+1 — Pj ¥j-1~ Pj 
for some M > 0. This gives 
2—u(r)+ 2) com, Ws ER. (4.111) 
Combination of (4.110) and (4.111) gives 
-2< -v(r) + 2) coma, VrseR. (4.112) 
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If we require 
l1~M<v(r)<m<2, (4.113) 


then equation (4.112) gives 





m-2< W) cu 1. (4.114) 


Equations (4.113) and (4.114) imply that the graph of y(r) must lie in the 
shaded region of Fig. 4.11. For m = 2, M = 3 we obtain the first order TVD 





Fig. 4.11. Admissible region for graph of y(r). 


region derived by Sweby (1984) for the nonstationary case. 


The aim of flux-limiting is, of course, to obtain schemes of positive type of 
second (or higher) order of accuracy. For second order accuracy we must 
satisfy equation (4.98). Using (4.103) and (4.104) this gives 


w)=1. 


To have this point in the admissible region we must take M > 2. To make 
sure that the graph of y(r) passes through (1) = 1 we may formulate the 
following additional requirement: 


L> vr) >r, 1< vi) <r. (4.115) 


The results in the admissible region (shaded) of Fig. 4.12, which corresponds 
to the second order TVD region of Sweby (1984) if M = 3,m = 2. 
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Fig. 4.12. Admissible region for graph of y(r) for second order schemes. 


Normalized variable diagram for schemes of positive type 


Conditions to impose on flux limiters to obtain schemes of positive type are 
easily transported to the normalized variable framework by means of the 
correspondence relations (4.103), (4.104). The normalized variable diagram 
corresponding to the flux limiter diagram of Fig. 4.12 is obtained as follows. 
The interval 0 < r < œœ corresponds to 0 < ¢ < 1. The condition y < m < 2 
corresponds to 





m m 
5) < —+{(1-—)$. 
f(g) < 9 + ( z)” 
The condition ý% < (M — l)r corresponds to 
~n M+., 
Condition (4.115) corresponds to 
1 1 3 1 
L+ g> flp) > Ëg ae 
gta? 2fle)2 5%, OSS: 
l 1, ~ 3 1 
~4tig< Z 3 
3 +39 SIPS 5%, 5S) 


The resulting admissible region in the normalized variable diagram in the 
case m = 2, M = 3 is the interior of the dotted triangles in Fig. 4.9. As noted 
before (cf. equation (4.97)), characteristics of second order schemes have to 
pass through P = (1/2,3/4). For third order accuracy, the slope in P has to 
be equal to that of the K = 1/3 scheme, i.e. f (1/2) = 5/6. Finally, we want 
f to be continuous and f bounded on (0, 1). 
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The «-scheme with limiting 


A flux limited version of the «-scheme with f(¢) having the desired properties 
may be obtained as follows. Outside (0,1), f is given by (4.101). One [0,1] 
one could fit a third degree polynomial through (0,0), (1/2, 3/4) and (1,1) 
with f (1/2) equal to the slope of the x-scheme, i.e. 


f (1/2) =1-«/2. (4.116) 


However, it turns out that this results in f (0) > 2 for some values of «, 
which takes us outside the admissible region (dotted triangles of Fig. 4.9). 
This may also happen with a piecewise parabolic fit with a second degree 
polynomial through (0, 0) and (1/2, 3/4) satisfying (4.116). We therefore use 
a third degree polynomial on [0, 1/2), in order to control f (0). A parabola 
on (1/2, 1] is fine for x > 0, but gives an overshoot (f(%) > 1) for x < 0. In 
that case we use a third degree polynomial with f (1) = 0. This results in: 


f$) = % (0,1), 

f(p) = 26+ (K-1)p’ — 2G’, 0<G<1/2, 

FG) = 14 5K(B-1)+(w-IG-?, OS KS1, 2S G1, 
f(G) = 1-(L+ K)(G- 1)? -2(ġ- 1), -1<K<0, 1/2<G<1. 


The resulting characteristics are given in Fig. 4.9 for « = —1 (second order 
upwind scheme), k = 0 (Fromm scheme), « = 1/3 (third order upwind biased 
scheme), « = 1/2 (QUICK scheme) and « = 1 (second order central scheme). 


The corresponding limiter function is found to be, using (4.103) and (4.104), 





y(r) = o r<0, 
y(r) = + J z {1 + (3 — «)r +(2-—3x)r*}, 0<r<1, 
vr) = s+ (2 n)rh, r>1, 0<«<1, 
2 
vir) = (1+r) giter +r’ r>l, -l<K«<0. 


According to experience, somewhat better accuracy is obtained if the x-scheme 
(i.e. the dashed lines in Fig. 4.9) is used as much as possible. We there- 
fore choose the characteristic function f(g) piecewise linear, similar to the 
SMART scheme proposed in Gaskell and Lau (1988), and we will call it PL. 
The PL characteristic function is shown in Fig. 4.9. This type of character- 
istic function can be implemented very efficiently, as shown in Leonard and 
Drummond (1995), in the following way. The median function med(a, 6, c) 
takes three real numbers and returns the number that lies in between the 


4.8. More accurate discretization of the convection term 159 


other two or coincides with one of the other two. The median function is 
easily implemented as 


med(a, b,c) = max[min(a, b), min{max(a, b), c}]. 
From figure 4.9 it is clear that the graph of 
fi(G) = med{2¢, 1, >} 


is given by f1(¢) = ¢ for ø g (0,1) whereas otherwise the two upper sides 
of the triangle are followed. It follows that 


f(6) = med( f, (8), 53 + 4.9) 


gives the PL characteristic depicted in figure 4.9. Noting that d+med(a, b, c) = 
med(a + d, b + d,c +d) and dmed(a, b,c) = med(da, db, dc), we can switch 
from normalized to original variables for efficiency: 


fa(pj) = med(2y; — pj-1, Pj, Pj41) 5 
Yj41/2 med(f2(~j), 9341/2) Pj), 


tl 


k 1 l-k 
Phere = 3l tpit) +g lPi- t 2e = P+) - 


The corresponding PL limiter function is found to be, using (4.103) and 
(4.104): 


(r) = 0, r<0, 

(r) = 2r, 0<r<2/5, 

plr) = (2+r)/⁄3, 2/5<r<4, 

(r) = 2, r>4. 

Nonlinear characteristics for the QUICK scheme (x = 1/2) have been pro- 
posed in Leonard (1988), Gaskell and Lau (1988). Some well-known flux 


limiter functions are: 


van Albada, van Leer, and Roberts (1982): 








r?+r 
V(r) pi p2 +1 3 
Van Leer (1977b): 
_rtir| 
vr) = 1+ |r| ’ 


Roe (1986) (‘superbee’) 
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(r) = max{0, min(2r, 1), min(r, 2)} . 


With (4.103) and (4.104) the corresponding characteristic functions are found 
to be, respectively: 


van Albada: 
> 2 
~\_ ~ p- P 
van Leer: 
f(g) —_ 26- ğ’, ¢ € [0,1], 
f(y) = ĝ, ¢ g [0,1], 
Roe (superbee): 
f) = 28, OSEE, — 
IP) = 4049), beech, 
HP) = 3%) 9S <3, 
f(~) = 1, 30 9< 1, 
fig) = @ ¢ [0,1]. 


All of these characteristics are in the triangle of Fig. 4.9, and hence are 
wiggle-free and TVD. Also, they pass through (1/2, 3/4), but the second 
order accuracy of the ‘superbee’ flux limiter is in doubt, because f (1/2) 
is unbounded; cf. the assumption preceding (4.96). The ‘superbee’ scheme 
is known to suffer from ‘clipping’, i.e. near a maximum the numerical so- 
lution shows a plateau slightly below the exact maximum, and analogously 
for minima. Because f (1/2) does not exist, the ‘superbee’ scheme cannot be 
regarded as a nonlinear version of the «-scheme. For the van Albada and van 
Leer flux limiters we have f (1/2) = 1, corresponding to « = 0 (cf. (4.116), 
so that these may be regarded as flux limited versions of the Fromm scheme. 


Exercise 4.8.1. Let the grid be uniform. Determine Fiti? by interpolating 
Pj+1/2 to the highest possible order of accuracy between pj-1, p; and j41, 
and verify that this gives the « = 1/2 scheme. Determine the order of the local 
truncation error in the approximation of dy/dz for « = 1/2 and « = 1/3. 
Next, determine Fe asa by interpolation of 9341/2 to highest order between 
$j-11-++; Pj+2 Determine the order of the resulting local truncation error in 
the approximation of dy/dx. Do this also for (4.91), and compare. 


Exercise 4.8.2. The following flux limiter is mentioned in Roe (1986): 


y(r) = max{0, min(r,1)}. 
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Determine the corresponding characteristic f(~). Is the scheme of positive 
type? What is its order of accuracy? (Note that f (1/2) does not exist). The 
minmed function is defined as, 


z if |x| <|y| and ry>0, 
minmod(z,y) = 4 y if |z| > |y] and zy >0, 
0 if ry<0. 


Show that y(r) = minmod(1, r). This is why this is called the minmod limiter. 


Exercise 4.8.3. Verify that the PL limiter is inside the admissible region of 
Fig. 4.12. 


5. The nonstationary convection-diffusion 
equation 


5.1 Introduction 


The equation to be studied in this chapter is the nonstationary convection- 
diffusion equation: 


oF + tava —(Dea)a=% 2EQCR™, O<t<T (5.1) 


See Sect. 2.3 for initial and boundary conditions and further comments, and 
Theorem 2.4.3 for the maximum principle satisfied by (5.1). Discretization of 
the spatial derivatives has been discussed in the preceding chapters. In the 
present chapter, attention will be focussed on temporal discretization. As a 
consequence, numerical stability will loom large. As in the preceding chap- 
ter, the aim is not merely to discuss the numerical solution of the relatively 
simple equation (5.1), but to prepare the ground for the numerical solution 
of the equations of fluid dynamics. This will guide our selection of numerical 
methods to be discussed, from the plethora of schemes available. Further in- 
troduction to the subject may be found in Morton (1996), Strikwerda (1989). 


Accuracy requirements 


The choice of the temporal and spatial step sizes is governed by accuracy and 
stability considerations. In the ideal case accuracy and stability restrictions 
on the time step 7 should be about the same. If r is much more restricted by 
stability than by accuracy, efficiency may be gained by switching to a possibly 
more expensive but more stable time discretization method, and vice-versa. 
If 7 and La are the time and length scales in the z,-direction of the exact 
solution, then for accuracy it is required that the time step T and the spatial 
mesh sizes ha satisfy 


rs T/eo, he £ LajfCa (no summation). (5.2) 


How large co, ..., €m should be depends on the accuracy required and the order 
of the discretization. If the global truncation error is O(r?) then co can be 
smaller than when it is O(r). It seems safe to assume that in practice one 
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often needs at least co, ..., €m © 10. Later, on various occasions estimates of 
T and La will be presented. 


5.2 Example of instability 


Let m = 1, u = q = 0, D=1 and 2 = R. Let the spatial grid be 
uniform with mesh size h, and let 7 be the time step, kept constant. Let h 
be the numerical approximation to (nr, jh), with y(t, x) the exact solution. 
Equation (5.1) is discretized as follows: 


CRA E R T =0, (5.3) 
which may be rewritten as 


prt! — ages + (1 — 2a) yy ~} api 1 a= r/h? . (5.4) 


Let the initial condition be given by 
p(0, £) =ecosmz/h, (5.5) 


with 0 < £ < 1. The exact solution is 


= n’t/h? 


p(t, r) = £e cosmz/h. 


The initial condition for (5.4) is 
PS = e(—1) ’ 
and the solution of (5.4) is found to be 


y? = e(l — 4a)” (~1¥ . 
Hence, if a > 1/2, |~?| > œ% as n > oo, quite in contrast with the exact 
solution. The error e? = (nr, jh) — p7 is also unbounded for finite fixed 
time T = nr, since 

ej = fe T/™ _ (1 — 4a)? IT}(-1p , (5.6) 
which is unbounded as 7 | 0, A | 0 with a > 1/2. One might think that 
with more smooth initial data scheme (5.3) may perhaps provide a good 
approximation of the exact solution with a > 1/2, but because in practice 
rounding errors always give perturbations of a rough sort, a > 1/2 invariably 
gives bad results. 


For 0 < a < 1/2 we have 


5.2. Example of instability 165 


(1-—4a)7/" = exp{ in(i — 4a)} = exp{=(~4a — 8a? —...)} 
= exp{—35 (1 +2a+...)}. 


Substitution in (5.6) shows that the numerical solution has a wrong decay 
rate, so that the error e7/7, though now bounded, does not vanish upon mesh 
refinement. This does not mean that scheme (5.3) is not good. The cause of 
non-convergence is the extreme roughness of the initial condition (5.5). To 
show this we now present a proof of convergence for the present simple case. 
Let the initial condition be general but independent of h. Substitution of the 
exact solution in (5.4) gives 


p{(n+ tr ih} = apin (i+ YA} +(L—2a)e(r,jh) (5:7) 
+ ap{r,(j-1)h} +77, 
with the local truncation error 7? (not to be confused with the time step 7) 


j 
given by 
2 t* 4 x 
alpea) 1 ptt) 


oe O a O B ôA 
where nr < t* < (n+ 1)7 and (j — 1)h < 2” < (j+ Dh. Subtraction of (5.7) 
and (5.4) gives 


(5.8) 


ent} = aej4, + (1 — ale? + aef_, +7). 
If a < 1/2 we have 

le" < He" + le", (5.9) 
with ||- || the maximum norm. According to (5.8), 


IIn"I| < (Cir + Cah’) , 


with 


Q 
| 


1 
= sup{5|0°p/dt"| 0 <t <T, z ER} 


S 
| 


1 
= sup{ 7510" ¢/d2"| :0<t<T, zeER} 
Repeated application of (5.9) gives, assuming e} = 0, 


lle! < T(Cir + Cah?) 


so that we have convergence: the error can be made arbitrary small by mesh 
refinement, provided a < 1/2. We also see what goes wrong with initial 
condition (5.5): Cz = O(1/h4). 
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5.3 Stability definitions 


For a more extensive treatment of stability theory than given here, see Richt- 
myer and Morton (1967). 


Notation 


Let the interval (0,7) be divided in N equal subintervals of length 7. By 
y? we mean the numerical approximation to y(t,#) at t = nr in the grid 
point æ;, with j = (j1,...;Jm) grid point indices in the spatial grid Gh C £2, 
as before. The subscript A indicates, that we will consider a family of grids 
depending on the parameter h € R. Usually, no confusion will arise from using 
the symbol y both for the exact and numerical solution, but if necessary we 
will be more specific, Let y stand for the algebraic vector with elements ,;, 
ordered in some way. We will consider numerical schemes that can be written 
as 


(Byy"*"); = (Boe"); + (B2q”); + (Bsq"**); , (5.10) 


with Bo, By, ...B3 linear operators (that if we wish may be represented by ma- 
trices). Schemes that use more than two time levels will be considered later. 
Of course, B; is assumed to be nonsingular. Among other things, Bo, + , B3 
depend on the spatial mesh sizes hy and the time step T. Let these be de- 
creasing functions of a parameter h > 0, equal to zero for h = 0. We rewrite 
equation (5.10) as 


pith = (Co(h)y"); + (C2(h)e"); + (Ca(h)er™); 5 (5.11) 


with Ck = By’ By. 


Local truncation error 


As before (Sect. 3.7) the local truncation error is defined as the result ob- 
tained by applying the discrete scheme to the global truncation error (exact 
minus numerical solution). Since the numerical solution satisfies the scheme 
(disregarding rounding error) we obtain: 


Definition 5.3.1. Local truncation error 
The local truncation error of the discrete scheme (5.11) is 


Tj = y((n + 1)r,#j) — Colh)y(nr, ej) — Ca(h)qg? — C3(h)g t! , 


with ọ the exact solution of (5.1). 
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Consistency and convergence 


Let (5.11) be an approximation of (5.1) times r°|02; |? with |3 2;| the volume 
of some cell around z;. Then consistency is defined similar to definition 3.7.2: 


Definition 5.3.2. Consistency 
The scheme (5.11) is called consistent if 


lim tf /(r*|0Qj") =0, j€Gr, 1<n<T/r. 


This implies that the maximum norm is used to measure 7”, and the same 
difficulty crops up as in Sect. 3.7, namely that if a weaker norm is used 
an inconsistent scheme may get reclassified as consistent, and may still be 
convergent. However, we will study stability only on uniform grids, where 
this difficulty does not occur. 


Similar to Definiton 3.7.3, we define 


Definition 5.3.3. Global truncation error 
The global truncation error is defined as 


ef = plint, æj) 97, JEGnr, l<n<T/r. 


Furthermore, we define 


Definition 5.3.4. Convergence 
A scheme is called convergent if the global truncation error satisfies 
Tir 


lime; 


imej, = 0, æj, fixed. 


Here it is assumed that 7 and the spatial mesh sizes belong to a decreasing 
sequence such that T/r is an integer and æ;, is a grid point in the spatial grid. 
Convergence means that the exact solution can be approximated arbitrarily 
closely at a fixed time and position by decreasing 7 and refining the grid. 


Stability definitions 


As we saw in the preceding section, consistency does not imply convergence. 
In addition, stability is required. Let 6° be a perturbation of p°. The resulting 
perturbation of y” is called 6”. From (5.11) it follows that 


6"! — Co(h)d” . (5.12) 


Let ||- ||, be some norm for functions Ga — R. Stability means that ô” 
remains bounded as n — 00, for all 6°. Two useful definitions are: 


168 5. The nonstationary convection-diffusion equation 


Definition 5.3.5. Zero-stability 
A scheme is called zero-stable if there exists a function C(T) and a function 
To(h) such that 


or Wn < CITIS? la (5.13) 


for all r < 7o(h) and all h < ho for some fixed ho. 


The appellation ”zero-stability” refers to the fact that the limit h | 0 is 
considered. 


Definition 5.3.6. Absolute stability 
A scheme is called absolutely stable if there exists a constant C and a function 
To(h) such that 


l8” lla < ClO (5.14) 


for h fixed, all n > 0 and all 7 < ro(h). 
The difference with zero-stability is that here A is fixed. 


Lax’s equivalence theorem 


Definition 5.3.5 considers the perturbation at a fixed time T as h | 0, which 
is the same limit as in the definition of convergence. It can be shown that 
convergence implies zero-stability, and zero-stability plus consistency imply 
convergence. This is known as Laz’s equivalence theorem (cf. Richtmyer and 
Morton (1967)). But absolute stability is also good to have, because it allows 
n to grow indefinitely, making it possible to continue time stepping until 
a steady state is reached. Absolute and zero-stability are not completely 
equivalent. 


A remark on stability analysis 


The purpose of stability analysis is to find a suitable function 7 9(h) such that 
(5.13) and (5.14) hold. This is the case if the linear operator in (5.12) satisfies 


CoA) <C. 


In the case of absolute stability, h and hence the dimensions of the matrix 
Co are fixed, and linear algebra can be used to find conditions under which 
||[Co (h)|| is bounded as n > oo. But in the case of zero-stability, n — oo 
and A | 0 simultaneously, and we have to study not just the behavior of a 
linear operator, but of a family of linear operators. The dimensions of the 
matrix increase as n — oo. This is not a familiar situation in linear algebra. 
We will see that Fourier analysis is well-suited to the study of both kinds of 
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stability, if the boundary conditions are periodic. For stability analysis for 
other kinds of boundary conditions, see Hindmarsh, Gresho, and Griffiths 
(1984) and Richtmyer and Morton (1967). 


Choice of norms 


The stability of a scheme depends on the norm. The /, norm is defined as 


Sp 


1 
È DH, 1<p<o, 
JEG, 


llôllo = max{|ő;|: j E Ga}, 


where N is the number of grid-points in Ga. In the hyperbolic case (D = 0) 
there are schemes that are stable for p = 2 but instable for every other 
p € [1, 00), cf. Richtmyer and Morton (1967). It can be shown that /?-stability 
is always necessary (Geveci (1982)), otherwise the calculation blows up due 
to catastrophic growth of rounding errors. We will see that precisely for p = 2 
stability is easily studied with Fourier analysis. 


Exercise 5.3.1. Consider (5.1). Let m = 1, ui = 0, D = constant. Choose 
the following time discretization: 


prtt g" — rD(wyt! + (1—w)e') = wrt + (1—w)rq” . 


Let the spatial mesh size h = constant. Show that space discretization with 
the finite volume method results in 


h(t — 3) + Lai(wgr t +(1—w)y?) = whrqet? +(1-—w)hra} , 
where 
Lave; = (—pj-1 + 24; — pj-1)rD/h . 


Show that the local truncation error satisfies 


h 1 Ap n ap n 
TP = geU -og 2)) oga e) 
1 ð? x 1 * k 
+ air SE Ce) - sh Deanta") 


with t”? < t*,t** < ttl and xj_1 < 2* < 2541. Note that with w = 1/2 + O(r) 
we have improved temporal accuracy. Determine a and 8 according to Defi- 
nition 5.3.2. 
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5.4 The discrete maximum principle 


As we shall see, the discrete maximum principle is sufficient (but not neces- 
sary) for stability. Define 


Lilt! p") = Biet! — Boy” , (5.15) 
with Bo and By, as in (5.10). Let J denote the set of grid point indices, i.e. 
[= {j :j = (j1, nn jm), Ja = 0, lyas Na -l, a= Ly wa, m} ` 
Let in grid point j € I equation (5.15) be equivalent with 
Lalor gp) = $O (ange tt — Bae eye) » (5.16) 
kek 


with k = (k1,...,km) and K some set that follows from the stencil of the 
discretization. Let Ip C I be the set of boundary points where a Dirichlet 
condition is applied. In the cell-centered case, where there are no points on 
the boundary, we extend J with the (fractional) Dirichlet boundary point 
indices. Quite similar to Definition 4.4.1, we define 


Definition 5.4.1. The operator Lp is a of positive type if 


2 (ajk — Bie) =0, jE NID, 
keK (5.17) 
aj, <0, k#0 and B,>0, KEK, jEel\Ip. 


Note that it is the coefficient of pet that has to be positive. 


Theorem 5.4.1. Discrete mazimum principle 
If La is of positive type and 


Lilgett, yg?) <0, jep, n=1,2,..., (5.18) 
then local mazima can occur only for j € Ip and n = 0, or 
pr =p" t=... = p? = constant. 


This is quite analogous to the maximum principle for the differential equation 
(Theorem 2.4.3). The proof is similar to that of Theorem 4.4.1. 


For the perturbation 6” in the stability definitions of the preceding section 
we have 


L,(62*?,6")=0, jel. (5.19) 


Note that 6? = 0, 7 € Ip. If Lp is of positive type we have both a maximum 
and a minimum principle, because the right-hand-side of (5.19) is zero, and 
by changing the sign, Theorem 5.4.1 gives a minimum principle. Hence 
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[37| < max{|6;|: j € I}, 
so that 
l8” llo < I’ll - 


The following theorem follows: 


Theorem 5.4.2. If La is of positive type, then the scheme (5.10) is both 
zero-stable and absolutely stable in the marimum norm. 


It will be seen later, that the positive type property is not necessary for sta- 
bility. Multistep schemes (using more than two time levels for discretization 
in time) never have the positive type property, if their temporal accuracy is 
O(r?) or better. 


5.5 Fourier stability analysis 


In general it is difficult to derive estimates like (5.13) and (5.14). But if the 
coefficients in the scheme are constant, the mesh uniform, the grid a rect- 
angular block and the boundary conditions periodic, then Fourier analysis 
applies and the required estimates are not difficult to obtain. 


In practice, of course, the coefficients are usually not constant. The scheme 
is called locally stable if we have stability for the constant coefficients scheme 
that results from taking local values of the coefficients, and to assign these 
values to the coefficients in the whole domain (frozen coefficients method). 
In Richtmyer and Morton (1967), Chapt. 5 stability theory for schemes with 
variable coefficients is discussed. Local stability in the whole domain is nec- 
essary for stability in the variable coefficient case. Sufficient conditions are 
discussed in Richtmyer and Morton (1967). Quite often, it is sufficient that 
the scheme is dissipative, see Definition 9.3.1. We will discuss stability only 
for constant coefficients. 


Stability theory for non-periodic boundary conditions is complicated. But 
for explicit time stepping schemes it takes a while before the influence of 
the boundary conditions makes itself felt in the interior, so that Fourier sta- 
bility theory applies during a certain initial time span. As a consequence, 
stability with periodic boundary conditions is desirable, even if the bound- 
ary conditions are of different type. For stability analysis in the presence of 
non-periodic boundary conditions, see Richtmyer and Morton (1967). 


In the remainder of this section we present the basic principles of Fourier 
stability analysis. In the next section, application is made to the convection- 
diffusion equation. 
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Discrete Fourier transform 
Let I denote the set of grid point indices, i.e. 
I= {j:j= (Jjresjmh) Ja=0,1,..,na-1, a =1,...,m}. 
Furthermore, define the set O of wavenumbers as 
O = {0:0 = (b1, ba), Oa = 2tky/No , 
ka = Ma; —Ma + 1l, -Ma Pa, 0 = 1, oo Mm}, 


where py = 0, Ma = (Nna — 1)/2 for na odd and pa = 1, Ma = Nna/2 — 1 for 
Na even. Define 


G0 = Y jaba. 
azi 


We have 


Theorem 5.5.1. Discrete Fourier transform in m dimensions. 
Every grid function p : I => R can be written as 


p= a, (5.20) 
T3: 
with 
ce = NT! Y ee, N= II Ney + 
gel a=1 
The proof is elementary, and can be found, for example, in Wesseling (1992). 


A consequence of the preceding theorem is 
X co =0, VIET > co=0, VEO. (5.21) 
to) 
The /2-norm is defined by 
lel = N$ (es), lel] = NHF Jeo??? 
jel PEO 
We will need 


Theorem 5.5.2. Parseval. 
If p and c are related by (5.20), then in the ly-norm 


lell = N*?]]el 
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The periodic constant coefficients initial value problem 


If the grid is uniform, the coefficients in (5.1) are constant and the boundary 
conditions are periodic, then a typical equation of the system (5.12) can be 
written as 


So oah = Do eS, GET. (5.22) 
kek keK 


Because of the periodic boundary conditions, 6" and equation (5.22) can be 
extended across the boundary of J. Write 


=) nije 
ô; = CoE . 


GEO 


Substitution in (5.22) gives 


jjo 1 ike ike 
) eI? cht J Ope" — > Bre’ | =0. 


GE@ kek kek 


Because of (5.21) the term between brackets is zero, so that 


with the amplification factor g(@) defined as 


9(8) = (X Ar’) /(X are’). (5.23) 


kek kek 


The von Neumann condition 


Since c? = g” (0)c9, 


lle" i] < g°llc°ll, g= max{|g(8)| : VO}, 


with the l2-norm. Using Theorem 5.5.2, we see that a sufficient condition for 
zero-stability is that there exists a constant C such that 


GTI <C (5.24) 
for 0 < 7 < m(hA). Since 
CTT = exp(= In C)=1+O0(r), (5.25) 
we may write 


g<14+O0(r). (5.26) 


174 5. The nonstationary convection-diffusion equation 


This is the von Neumann condition for zero-stability. This condition is also 
necessary, because if g > 1+ p, > 0, then there is a 8 with |g(@)| = 1 + u. 
Choosing ô? = e? gives ||6"||/||4|| = (1+ x)”, n = T/r, which is unbounded 
as T | 0. Note that the O(7r) term (5.26) is not allowed to depend on A; this 
follows from (5.24)—(5.26). 


For absolute stability, 
a" <C, Yn, 
so that 
g<l 
is sufficient, and, by the same argument as before, also necessary. 


We will study a number of particular cases that are representative of fre- 
quently used schemes in computational fuid dynamics. 


5.6 Principles of von Neumann stability analysis 
Introduction 


The derivation of conditions that are sufficient or necessary for the von Neu- 
mann condition to hold for a particular numerical discretization scheme re- 
quires detailed and often tedious analysis of each particular scheme. It turns 
out that for many schemes used in practice only partial (for example, re- 
stricted to one spatial dimension) stability analysis results, or none at all, 
are available. Therefore we present below an approach that gives general re- 
sults for the full convection-diffusion equation in an arbitrary number of space 
dimensions. ‘The reader who wants to derive a stability condition will benefit 
from mastering this section. The outline is as follows. First, the symbol or 
Fourier transform of the spatial discretization scheme is introduced. Stability 
of an associated time stepping scheme is shown to be equivalent to a certain 
geometric condition on the symbol, namely that it must lie in a certain re- 
gion of the complex plane, called the stability domain of the time stepping 
scheme. In Sect. 5.7 we derive a number of universal properties of the sym- 
bol. Complexity due to multi dimensions is mastered by using the Schwartz 
inequality. These universal properties make it easy to obtain in Sect. 5.8 
stability conditions for particular cases that are important in computational 
fluid dynamics. 


Spatial discretization 


The convection-diffusion equation in m dimensions is given by 
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Op _ B m ð a? 
OE +Lp=0, Le= d (Maas -= Dani)? . (5.27) 


For Fourier stability analysis, u, and D are taken constant, the domain is 
the unit cube and the boundary conditions are periodic. With the «-scheme, 


the spatial discretization of L on a uniform grid with mesh sizes hy,..., Am 1s 
given by TZ, = Ch + Da, with 
1 
Ch = 7d peal (1 ~ K)Pj-2ea — (5 — 3K) Pje (5.28) 


ie 


+(3 — 3K) pj + (L+K)¢j+e.}5 


and 
1 
D, = 9 5 do(—Pj-eq + 29; — Pjtea) » (5.29) 


where j = (ji,---) jm), €1 = (1,0,...,0), e2 = (0,1,0,...,0) etc., and 
Ca = lUglt/he, da =2Dr/h? , (5.30) 


with 7 the time step to be used later in the discretization of Op/0t. In (5.28), 
Ua > 0 has been assumed; the contrary case may be treated by symme- 
try. The dimensionless numbers ca are called the CFL (Courant-Friedrichs- 
Lewy) numbers, after the authors of a pioneering paper published in 1928, 
and reprinted in English in Courant, Friedrichs, and Lewy (1967). The dimen- 
sionless numbers dą will be called diffusion numbers. The first order upwind 
and fourth order central schemes are given by, respectively, 


1 
Ch = 7 eale; — Pj-ea) 
and 


l 
Ch = For S CalPj-2ea — BPj-ea + 8Pj+ea — Pitzen)» (5.31) 


while D, is given by (5.29). 


The symbol 


After spatial discretization we are left with the following system of ordinary 
differential equations: 


dp;/dt = —Lnay; - (5.32) 
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The symbol or Fourier transform Ê, (0) of La is defined by 
Ln (0) =e"? Lp’, (5.33) 
One finds 


tLn(8) = Ca(8)+ Di), 
Ca(8) = y1(8) + iy2(0), Dy (8) = 6(8). 


For (5.28) we have, also if ua < 0, 


71(9) = 2(1~&) So cast, V20) =X ca{(1—K)sat+l}sinO,, (5.35) 


a 


(5.34) 


where sa = sin? toa, whereas for the fourth order central (or FOC for brevity) 
scheme (5.31) 


1 . 2 , 
yi(6) =0, y2(8) = 6 X ca(8sin ba — sin 26,,) = Xocal + 3a) sin Oa . 


ae ae 


Defining 
Ča =(1—kK)eq {K-scheme), ¢,=0 (FOC scheme) 
we see that for both schemes 


41 (8) = 2S čas? 42 (0) = So caf(1 — K)5Sa + l}sinĝa , (5.36) 


a 


provided we define « = 1/3 for the FOC scheme. For all schemes, 
5(9) =25 dasa. (5.37) 


The symbol of the first order upwind scheme is obtained by taking x = 1 and 
replacing da by da +ca. Therefore this scheme does not need to be considered 
explicitly in the sequel. 


Stability 


Following the definition of stability in Sect. 5.3, a perturbation 6; (0) is added 
to the initial condition of (5.32). Because of linearity, the equation for the 
perturbed solution is the same as for the unperturbed solution, and it is 
convenient not to change notation, but to denote the perturbation by y;(t). 
According to Sect. 5.5 we may write 


pit) = Sd) p. 


GEO 
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Substitution in (5.32) gives 


S {dgo/dt + Ly (Ope }e?? = 0. 
TE 


From (5.21) it follows that 
dôs/dt = —Ln(8)ge - (5.38) 


According to Parseval’s theorem 5.5.2, ||(t)|| = N4/?||yo(t)||. Hence (5.13) 
implies 


A(T) < CIE) 


For this it is sufficient that 


\Pa(T)| < C(T)|Ge(0)|, VOEO. (5.39) 


This also necessary, because the initial perturbation ¢;(0) is arbitrary, and 
one may take ;(8) = ĝe"? , giving equality in (5.39). Since the exact solu- 
tion of (5.27) does not grow in time (cf. Exercise (5.6.1)) we take C(T) = 1 
(allowing C(T) > 1 relaxes stability restrictions on 7 in a negligible way). 
Hence we require that after discretizing (5.32) in time the resulting solution 
satisfies 


Se) < il, Vaeo. 


The stability domain of numerical methods for the differential equation 
dy/dt = Ay, A € C (with C the complex plane) is defined as 


S= {rr eC: |y t| < jy"]}. (5.40) 


Comparison with (5.38) shows that for von Neumann stability it is necessary 
and sufficient that —rZ,(0) € S, YVO € O. In order to facilitate the analysis 
we sharpen this condition just a little by requiring it to hold for all 8. This 
leads to the stability condition that we will use: 


St CS, Sp ={-rL,(0): YO}. (5.41) 


This completes our discussion of the basics of von Neumann stability analysis. 
In order to deduce practical stability conditions in two and three dimensions 
from the basic condition (5.41), we will have to go into some technical details 
in the next two sections. 


Exercise 5.6.1. Determine the symbol of the operator L in equation (5.27). 
Show that solutions of (5.27) on an unbounded domain do not grow in time. 
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5.7 Useful properties of the symbol 
Preliminaries 


Since Sz does not depend on the time discretization to be used, it is useful to 
study some of its properties before investigating stability of time discretiza- 
tions by means of (5.41). 


Let da = da + Ča and d = © da. Then for both the «- and the FOC scheme 


we may write, since 0 < sa < l, 
(ô + 41)? = {> da Sq + Eqs}? < 4{S dasa}? : 
a 
Noting that da > 0, writing dasa = V dalsa y da) and using the Schwartz 


inequality (which we will apply frequently) : |a -b| < |ja|||[6|] for a,b € R”, 
we obtain: 


(6+)? <4d Y das? . (5.42) 


Furthermore, for both schemes, writing k = (1 — «), we have 


92 = [X (ca/Vda){Vda(Rsa +1) sin 6,}]? . 


a 


Applying Schwarz we obtain 
V <4> 2 /da >. dasa(l — Sa)(RSa +1). (5.43) 
Noting that |s,| < 1 it follows that 
y3 <4(2—K)? So cr /da X dasa(l — Sa) - (5.44) 


Furthermore, 


vf = (Pas) {ch d} (Rsa + 1) sin Ba 


ie 


Applying Schwarz gives 
V S {X (e4/da) PPD (c/d) {dl ksa + 1)? sin? 6a}? 
Another application of Schwartz results in 


12 < 16{) (ch /da) PF Y dalsa +1)*sQ(1 — 8a)’. (5.45) 
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Furthermore, for arbitrary a > 0 we have for both schemes, using (5.42): 


(+ - i) =14 5 Y duet -IX daSa + Čas? 


If d < a this gives 


(i i) <1+- a È dals 2 sa). (5.46) 


a 


Theorems on Sz 


Next, we present some theorems on the location of Sz in the complex plane 
C. The usefulness of these theorems will becorne apparent later. 


Theorem 5.7.1. (Rectangle) 
If 


i<a/2 and Ñ` cq <b/(2- xK) (5.47) 


then S; is contained in the rectangle defined by 
~a<v<0, jul <b, v+iw=z. 
The first condition is necessary. 
Proof. We have to show that —rL,(0) € rectangle, V0, or 
0<d+71<a and |y2|< 5, V8. (5.48) 
We have 


8 +y = 2Y (dasa + Gass) - (5.49) 


ie 


Hence 0 < + yı < 2d and the first part of (5.48) follows. Necessity of the 
first condition of (5.47) follows by taking s, = 1. Furthermore, 


val < So ca{(1 — K)sa + 1}[sinOg| < (2-6) Soca <b. 


a 


a 


The conditions (5.47) are equivalent to the following time step restriction for 
the «-scheme: 


+ <minf 5 sp! ha (2+ (1 —£)pa), 5! E llual/ha)} (5.50) 
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where the mesh Péclet number pg is defined as 
Pa = |Ualha/D. (5.51) 
For the FOC scheme we obtain 
. fa _2 3 
r< min 75/2 | 50/2 lual/H} | (5.52) 
whereas the condition for the first order upwind scheme is 
: a -2 
r< min op a he (2+ pa), 6/ Si Iual/ha) } . (5.53) 
Theorem 5.7.2. (Ellipse) 
If 
d<a and X È /da < (2—K«)7*b*/a (5.54) 


then Sr is contained in the ellipse 
(v/a+1)? + (w/b)? =1, v+tiw=z, a>0. 
The first condition is necessary. 


2 
Proof. We have to show that (see — 1) + (2)? < 1. Necessity of the 


first condition follows by using (5.49) and taking sy = 1. Continuing with 
sufficiency, using (5.44), (5.46) and (5.54) we obtain 


b+ (my? 4 2 2 
(=x -1) + (3) S147 dala — sa + sa = 84) <1. 


m 


For « = 1, necessity of both conditions is shown in Wesseling (1995). The 
conditions (5.54) are equivalent to the following time step restrictions: for 
the «-scheme: 


a 2 
r< ming 5/70 +(1—*)pa), 2D- 1) >u}. (6.58) 


For the FOC scheme: 


T< min{ s/h, B Ea. (5.56) 


For the first order upwind scheme: 


a 2 
T< min{ 5/ Doha" + Pa), 2D—/ iva} . (5.57) 
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Theorem 5.7.3. (Oval) 


If 
d<a (5.58) 
and one or both of the two following conditions hold: 
D (cã/da)? < qa (b/a), Sica < q2b?/a, (5.59) 
where 


qa = (8 — 4«)~°/3(15 — 5r — r) (5r -3 +r) P(O- Tkr), 
r = (25k? — 54K +33), 
q2 = (1—4«)9/*(8/5 — 4/5), —1<k<1/2, 
q2 = ta-w), 1/2<K <1, 
then Sr is contained in the oval given by 
(v/a +1} + (w/b) =1, v+iw=z, a>0. 
Condition (5.58) is necessary. 


Proof. Necessity of (5.58) follows by taking s, = 1. Continuing with suffi- 
ciency, assume that (5.58) and the first condition of (5.59) hold. Using (5.45) 
and (5.46) we obtain 


ô + y1 ? ny’ 
(22-1) (2< 
4 
1+- So da(sa — 84){—1 +44f (Rsa + 1)*sa(1 — sa)}. (5.60) 
a ~ 
Since 


max{(ks+1)*s(1—s): 0< 8< 1}= 1/4], 


no term in the sum in equation (5.60) is positive, so that Sz C oval. Next, 
assume that (5.58) and the second condition of (5.59) hold. Because of (5.58), 
0 < +71 < 2a, hence, with č = 4° é, and using Schwartz, 


ð +y t 6 (64m n yı 2 
(2-1) = 2 (B+ Ba) 4 (2-1) 
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Furthermore, similar to (5.45), 


meet ča) yey Deal Ksa + 1)4s2(1— sa)? 
Dea Daal KSq + 1)4 s? (1 — sa )?. 

Hence, using the second condition of (5.59), 
ô +y ? y2\4 
(=> 7 1) +(7) s 


4 2 
1+ a least (1 — sa) 2 (ksa + 1)4(1 — sa) - 1 — sa}. 


13 


IA 


Hl 


Observing that 
max{(ks+1)*(1—s)/(l+s):0<s<1,0<k<1/2}=1 
and that 
max{(ks + 1)*(1—s): 0<s <1, 1/2< R < 2} =k?/4q2 
we see that each term in the preceding sum is non-positive. o 


Note that the first condition of (5.59) is not useful in the hyperbolic case 
(D = 0, i.e. da = 0), whereas the second condition is not useful for «x = 1; 
the two conditions complement each other. 


The time step restrictions equivalent to (5.58) and (5.59) are: for the «- 
scheme: 


TS Eha (2+(1—«)pe) and 





4 1/3 -1/3 2 (5.61) 
r< maxfa (22E) (Zuty) > ante /TUlual/ha)} 
For the FOC scheme: 
T< ap/ hz? and 
(5.62) 


_ (opet\ 9 (ag a\ app 
r < max} qr a ye ug he, ’ AP /X(lual/ha)} , 


a 


where ĝi = } (54412) (VTO — 1)5/3 = 0.6360. 
For the first order upwind scheme: 


r< max{ 5 DAE + Pa) (2E) En). (5.63) 
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Theorem 5.7.4. (Half ellipse) 


If 
~ a od 9 ~ -2 X 
d < 3 and p2 cada < (2 — K) (1 + 1—- 4d? fa?) ; (5.64) 


a 


then Sr is contained in the left half of the ellipse given by 
(v/a)? + (w/b)? =1, v+iw=z. 
The first condition is necessary. 


Proof. That —rL,,(0) is not in the right half of the ellipse follows from ô + 
yı > 0. Necessity of the first condition follows as before. Continuing with 
sufficiency, using (5.42), (5.44) (which also holds with da replaced by da) 
and (5.64) we have 


l 
(+2) 4 (2y < 1a È dasalsa + p(1— sa)} , 


a 


where p = (a?/2d?)(1 + 4/1 — 4d?/a?). Since p > 2, max{s(s + p(1 — 8)) : 
0< s< 1} = pP {4(p-1)} = a?/(4), so that (5+7)?/a? + (92/0)? < 1. 0 


The conditions (5.64) are equivalent to the following time step restrictions: 
for the «-scheme: 


in| -2 _ b — Ad? fq? 1/2 
r < min [a Latha (2+ (1 — kK)Pa), z— (+y! 4d /a?) 
a Ls (2+ (1 —K)pa) pai (24 (1 -— K)pa) t} u2], (5.65) 
For the FOC scheme: 
— Ad2 /q2\l2 -2 -1/2 
7 <min| a5 Dha’, Sb(1+ y1 4d? Ja?) [2 h X ua} |. 
(5.66) 
For the first order upwind scheme, with d = rD T hz (2+4 Pa), 


r < min[z5/ 22h (2 + Pa); s(14 yrz) 
ODO (2+ pa om (2+ pa)” Mal (5.67) 


These inequalities are implicit, because d depends on 7. However, checking the 
admissibility of a given 7 is straighforward, whereas generation of a suitable 
T is easily done by some iterative process. Furthermore, without losing much 


sharpness, the term 4/1 — 4d? Ja? may be replaced by 0. 
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Theorem 5.7.5. (Parabola) 


if 
Soc fda < gb? , (5.68) 
where 
q3 = L, ł LKZ l, 
q3 = Ẹ(1- k)(2 - K), -1 <k < 1/2, 


then Sr is contained in the parabola given by 
v+ (w/b)? =0, v+iw=z. 


Proof. We have to show that —ô — yı — (y2/b)* < 0. Using (5.43) and (5.68) 


we have 


—5 — yı + (y2/b)? < 2 X dasa{-1 + 293(1 — sa) (Rsa + 1)*}. 


Bij observing that 
max{(1—s)(ks+1)*: 0< s< 1} = 1/243 
the proof is completed. A 


Condition (5.68) is equivalent to the following time step restriction: for the 
K-scheme: 


T < 2Dq3b"/ X u}. (5.69) 


For the FOC scheme (5.69) holds with q3 = 435/53 = 0.4860. For the first 
order upwind scheme we obtain 


1 2 2 —1 
T< Db [Ds wal2+ pa) . (5.70) 


The preceding theorems (exept Theorem 5.7.1) have been put forward in 
Wesseling (1995), Wesseling (1996c), as has the method that will be followed 
in the next section to derive stability conditions. 


5.8 Derivation of von Neumann stability conditions 
We will now use the properties of Szr, derived in the preceding section to obtain 


von Neumann stability conditions for time discretizations that are frequently 
used in computational fluid dynamics, or that are of historical interest. After 
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space discretization according to (5.28), (5.29) we are left with the system 
of ordinary differential equations (5.32). A good source of information on 
numerical methods for ordinary differential equations is Hairer, Nørsett, and 
Wanner (1987), Hairer and Wanner (1991), where an introduction to stability 
domains may be found. 


All that remains to be done here is to choose suitable unions or intersections 
of the sets considered in the theorems of the preceding section (rectangle, el- 
lipses, oval, parabola) inside the stability domain S of the time discretization 
method used to discretize (5.32), and useful (i.e. not too conservative and 
easy to evaluate) sufficient conditions for von Neumann stability tumble like 
ripe apples. Most of the following results are not generally known or new. 
The techniques used to derive the preceding theorems can be also used to 
obtain stability conditions for cases where the rectangle, ellipses, oval and 
parabola do not fit nicely in S; an example (leapfrog-Euler) will be given. 


For understanding the principles of our approach to the derivation of von Neu- 
mann stability conditions it is not necessary to study every case presented 
below in detail. For understanding, it suffices to study the cases of Adams- 
Bashforth and leapfrog-Euler, for example. A number of other schemes are 
included because they are frequently used in practice, and the stability con- 
ditions presented are not widely known or new. This section may be regarded 
as a catalog of time stepping schemes and stability conditions. Many refer- 
ences will be given to publications where these schemes are applied to the 
Navier-Stokes equations. 


Explicit Euler 


This time discretization method when applied to (5.32) is given by 


ptl = p" =-rLyp”. 


The temporal local discretization error is O(r). After Fourier transformation 
this becomes 


pot — pp = zp, z= -rL (6). (5.71) 


Substitution of pọ = €” leads to the following characteristic equation: 
E- (1+4 z)=0. 


The left-hand side is called the stability polynomial. Non-growth of |p} | is 
equivalent to the root(s) of the stability polynomial to be inside the unit 
disk. Hence z must satisfy |z+1| < 1. From this follows the stability domain 
S of this method: 
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S={zeC: |z+1|<1}. (5.72) 


Now we apply stability condition (5.41). The ellipse of Theorem 5.7.2 with 
a = b = 1 fits perfectly in the disk (5.72), so that this theorem gives 


Theorem 5.8.1. (Stability of explicit Euler scheme) 
A sufficient condition for von Neumann stability of the explicit Euler scheme 
is 


d<1 and S53 /da <(2—K)~?. 


Corresponding time step conditions follow inmediately from (5.55)—(5.57). 
For « = 1 (second order central scheme) these conditions are also shown 
to be necessary in Wesseling (1995), and identical to the necessary and suffi- 
cient conditions obtained for this scheme in Hindmarsh, Gresho, and Griffiths 
(1984), Hirt (1968), Morton (1971). 


The contribution to the local discretization error due to temporal discretiza- 
tion is called the temporal discretization error. No temporal discretization 
error is made if dp/dt = —Lny, so that dy, /dt = poet", hence 


polt +1) = pelt) . (5.73) 


Substitution of y¢(t) in the numerical scheme (5.71) gives the temporal local 
discretization error 7): 


in = polte — 1-2) = O(2?/r) = Or), 


where we have scaled by a factor 1/7, because (5.71) approximates the Fourier 
transform of the differential equation times 7. 


Adams-Bashforth 


In order to obtain better temporal accuracy, one may include more time 
levels. This leads to multistep schemes. An example is the second order 
Adams-Bashforth (or AB2, for short) scheme, frequently used to solve the 
Navier-Stokes equations, for example in Andersson, Billdal, Eliasson, and 
Rizzi (1990), Balaras and Benocci (1994), Benocci and Pinelli (1990), Birin- 
gen and Cook (1988), Breuer and Rodi (1994), Gao (1994), Gavrilakis, Tsai, 
Voke, and Leslie (1986), Gerz, Schumann, and Elgobashi (1989), Hoffmann 
and Benocci (1994), Kajishima, Miyake, and Nishimoto (1990), Kobayashi 
and Kano (1986), Kobayashi, Morinishi, and Oh (1992), Kost, Bai, Mitra, 
and Fiebig (1992), Kost, Mitra, and Fiebig (1992), Krettenauer and Schu- 
mann (1992), Kristoffersen and Andersson (1993), Lé, Ryan, and Dang Tran 
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(1992), Moeng (1984), Schmidt and Schumann (1989), Shimomura (1991), 
Thomas and Williams (1994), Voke and Gao (1995). The scheme is given by 


ntl _ oan), (5.74) 


This being a three-time-level scheme, it is not self-starting. A start can be 
made by one explicit Euler step. Including one step with an O(7) scheme does 
not affect the global O(r?) accuracy. The first step may even be instable. 


The temporal accuracy is determined in a similar way as for Euler. Taking 
the Fourier transform of (5.74) gives 


1 
n+2 1 n+l n 
pyt- pyt! = 5 2(3p5 7 — ph). 


Substitution of the exact solution (5.73) and scaling by 1/7 gives 


1 3 i 
în = —po(t)(e* — e — Sze? + 52) = O(2?/r) = Olr’). 


In the same way as before, the stability polynomial is found to be: 
z 
£- E- 5 (36-1). 


For z € S the roots are inside the unit disk. On the boundary OS, one or more 
roots have modulus 1, and can be written as € = e'# for some p € [0, 27). 
Hence 0S can be found by solving 


eth _ ett = (3e™ —1)=0, 
resulting in 
z = 2e” (et — 1)/(3e** — 1) (5.75) 


When p is varied, z traverses 0S. The result is shown in Fig. 5.1. All stabil- 
ity domains to be encountered are symmetric with respect to the real axis, 
therefore only the upper part is shown. 


We will now fit an oval and an ellipse in S. We have |z| < 1 for |p| < 1. 
Expansion of (5.75) for small || gives 


z & (4ip — p*)/(10 — 6 cos p) =v + iw . 


Hence on ðS we have near the origin w ¥ +(—4v)!/4, v < 0. Near z = 0 we 
have on the oval of Theorem 5.7.3: w = +b(—2v/a)'/4. We choose a = 1/2 
and with b = 1 the oval and 0S osculate in z = 0. In order to bring the oval 
partly inside ðS we choose b < 1. With a = 1/2, b = 2-1/4 the oval in Fig. 5.1 
is obtained. Near z = —1 this oval is outside ðS. We do not wish to make the 
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-1 OS -08 -07 -06 -05 -04 -03 -02 -01 0 


Fig. 5.1. Stability domain S of second order Adams-Bashforth scheme (—) with 
oval (- - -) and ellipse (-- -) 


oval smaller, however, because this would make the oval deviate more from 0S 
near z = Q, resulting in an unnecessarily strict stability restriction. Staying 
close to 0S near z = 0 is of importance, since in convection dominated flows 
\Im{rL,(0)}| >> [Re{rL,(6)}|. Therefore we also select the half ellipse of 
Theorem 5.7.4. Near z = —1 we have p= r +€, |e] < 1 in (5.75), resulting 
in the following local behavior of 0S: 


z S —1 + (5e? + 12%) /16 = —1 +v + iw, w= +3yv/5, v> 0. 


On the ellipse of Theorem 5.7.4 we have near z = —1, choosing a = 1: z= 
—~l+v+iw with w = +by2v. The ellipse and 0S osculate in z = —1 for 
b = 3/vV10. In order to bring the ellipse partly inside 8S we choose b slightly 
smaller; a = 1, b = ,/2/3 results in the ellipse partly shown in Fig. 5.1. The 
ellipse and the oval intersect in z = —1/4+i,/5/8. Numerical solution of the 
roots 1,2 of the stability polynomial shows that in the point of intersection 
we have max{{£,|, |€2|} = 0.9934 < 1, so that {ellipse N oval} C S. Hence, 
using Theorems 5.7.3 and 5.7.4 we obtain 


Theorem 5.8.2. (Stability of second order Adams-Bashforth scheme) 
A sufficient condition for von Neumann stability of the second order Adams- 
Bashforth scheme is 


d<1/2 and dyve2/da < (8 — 4K)-?(1 + 4/1 - 4d?/a?) 


and {Z ct /da) 3 < qi or Y < gav2}. 


Corresponding time step restrictions follow easily from (5.61)-(5.63) and 
(5.65)—(5.67). 


Comparing the AB2 scheme with the explicit Euler scheme, we remark that 
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the accuracy of AB2 is O(r?), and that the computing work is almost the 
same. Straightforward implementation of AB2 requires storage of the solu- 
tion vector at an additional time level (although this may be economized 
upon, cf. Exercise 5.8.3). One might think that explicit Euler is less stability 
restricted than AB2, because S (the unit circle at z = —1) is rather larger 
than S for AB2 (see Fig. 5.1). However, near z = 0, |Im{7Ê,(0)}| is less 
restricted (OS stays closer to the imaginary axis, which we try to exploit 
expressly by the introduction of the oval) for AB2 than for Euler, which may 
for convection dominated cases result in weaker stability restrictions for AB2 
than for Euler (cf. Exercise 5.8.1). All things considered AB2 seems more 
attractive than explicit Euler. The AB2 scheme is in widespread use for com- 
puting nonstationary viscous flows, especially in the context of large-eddy 
and direct simulation of turbulence. The lack of rigorous stability conditions 
before 1995 does not seem to have hampered this development. 


The third order Adams-Bashforth (or AB3 for brevity) scheme is given by 


1 
yrts — pnt? = -yg Eh (23y"+? _ 16y"71 + 5) . 


The stability polynomial is found to be: 
1 
E-E- 757 (236" — 16€ +5). 


The boundary @S of the stability domain S is computed in the same way as for 
the AB2 scheme, and is displayed in Fig. 5.2. The stability domain is covered 








As -0.5 -0.4 -0.3 42 -0.1 0 0.1 


Fig. 5.2. Stability domain S of third order Adams-Bashforth scheme (—) with 
osculating ellipse (- - -) 


to a satisfactory extent by the half ellipse of Theorem 5.7.4 that osculates in 
z = —6/11; its parameters are found to be a = 6/11, b = 72 \/2/235 = 0.6038 
(cf. Exercise 5.8.4). Application of Theorem 5.7.4 gives 
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Theorem 5.8.3. (Stability of third order Adams-Bashforth scheme) 
A sufficient condition for von Neumann stability of the third order Adams- 
Bashforth scheme is 


d< = and dY` c? Jda < 0.1823(2— «)~2(1 + y1 — (11d/3)2) . 


The temporal discretization error is O{r*). A look at the stability domains 
of Figures 5.1 and 5.2 leads one to expect that for convection dominated 
flows (|Im(rZ,,(8))| large) the stability restrictions will not be much more 
restrictive than for the AB2 scheme. This expectation is borne out in the 
example of Exercise 5.8.5. Compared to AB2, the solution at an additional 
time level has to be stored. The scheme can be started at t = 0 by an explicit 
Euler step, followed by an AB2 step. This scheme seems to be as attractive 
as AB2, and is also used in practice. 


The BDF scheme 


This is an example of an implicit three-level scheme. BDF stands for back- 
ward differentiation scheme; the method is also called the second order Gear 
scheme. This method has been applied to the Navier-Stokes equations in, for 
example Belov, Martinelli, and Jameson (1994), Breuer and Hanel (1990), 
Dwyer, Soliman, and Hafez (1986), Ho and Lakshminarayana (1993), Rogers 
and Kwak (1990), Rogers, Kwak, and Kiris (1991}. The BDF scheme is given 
by 
4 1 


2 
prt? _ ze" + ze" — —3rlne t? ; 


The stability polynomial is 


2 2 4 1 
(1—32)6 — 36 +3: 


Similar to the Adams-Bashforth scheme, on the boundary ðS of the stability 
domain S we have 


1 . , 
z= se — 2e7T'# 4 ` . 


When p is varied, z traverses 0S. The result is shown in Fig. 5.3. Note that 
the contour is in the right half of the complex plane. Since the roots of the 
stability polynomial satisfy |€1,2| > 1 inside the contour (as can be seen by 
computing 1,2 in z = 2, for example), the stability domain S is in this case 
the exterior of 0S. We see that in this case making the scheme implicit has 
the intended effect of making S large. Since Re(—rL,(6)) < 0, the BDF 
scheme is unconditionally stable. 


5.8. Derivation of von Neumann stability conditions 191 





a ane 





1s] os 1 15 2 256 a 36 4 


Fig. 5.3. Stability domain of BDF scheme 


Leapfrog-Euler 


This method has been widely used for large-eddy simulation of turbulence, 
for example in Arnal and Friedrich (1992), Boersma, Eggels, Pourquié, 
and Nieuwstadt (1994), Deardorff (1970), Eggels, Unger, Weiss, Wester- 
weel, Adrian, Friedrich, and Nieuwstadt (1994), Friedrich and Unger (1991), 
Groétzbach (1982), Grötzbach and Wörner (1994), Manhart and Wengle 
(1994), Mason (1989), Schmitt and Friedrich (1988), Schumann (1975), Wag- 
ner and Friedrich (1994), Werner and Wengle (1989). The idea is to improve 
upon the explicit Euler method by improving the temporal accuracy of the 
convection term and allowing higher CFL numbers. This is done by time- 
discretizing the convection term with the leapfrog method, which has no nu- 
merical diffusion: both the symbol of the differential operator and the discrete 
approximation are purely imaginary for central schemes. Leapfrog cannot be 
applied to the diffusion term as well, because this results in instability; this is 
left to the reader to show in Exercise 5.8.6. Therefore explicit Euler is applied 
to the diffusion term. The resulting scheme is given by 


prt? _ e" = —2(Cay"*! + Day”). 


This is an example of a mized method: the time discretizations of the con- 
vection and diffusion terms are different. After Fourier transformation this 
becomes 


ppt? — ph = -2(Ca(O)ept! + Da(O)es) | 
or. using (5.34), 
yet? — vf = —2(y(O)—5** + 6(8) yp) , 


with y = yi + 7y2, and d (given by (5.37)) real. The stability polynomial is 
given by 
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£7426 426-1. (5.76) 


For mixed methods, the coefficients of the stability polynomial do not depend 
on a single complex parameter z, as in the preceding cases (where z = y+), 
so that the stability domain S is not longer a subset of the complex plane. In 
the case of (5.76), where 6 happens to be real, the roots depend on the three 
parameters y1, Y2 and ô, and S is a subset of R*. It would not be difficult 
to redevelop the theorems of the preceding section in a three-dimensional 
setting, but a visual check whether the sets of these theorems are contained 
in S is much less straightforward in three than in two dimensions. We will 
not do this, and restrict ourselves for mixed schemes to the case y1 = 0, 
i.e. central second order (x = 1) or fourth order (FOC) discretization of the 
convection term. Hence, the roots of the stability polynomial depend on two 
parameters only, ô and y2, and we can continue to work in the complex plane, 
with z = —d — 749. 


As before, on 0S we have, with z = v + iw 
e?” _ diwe'# —Qv-1=0, O<p<2n. 
By solving for v and w one obtains 


sin 2p — 2wcosp = 0, 
cos 2u + 2wsinpt-—2v=1, 


For p # 1/2, 37/2 this gives v = 0, w = sin u, whereas for p = 1/2, 37/2 
one obtains v + w = —1. The resulting stability domain S is presented in 
Fig. 5.4. The ellipses, oval and parabola of the theorems of the preceding 


-1 
Fig. 5.4. Stability domain S' of leapfrog-Euler scheme. 


section fit badly in S, and we proceed directly. 
Theorem 5.8.4. (Stability of leapfrog-Euler scheme) 


a)For the k = 1 scheme, the leapfrog-Euler scheme is von Neumann stable if 
and only if 
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X (da + Va? +02) <1. (5.77) 
aQ 


b) For the fourth order central scheme a sufficient condition for von Neumann 
stability of the leapfrog-Euler method is 


X (da + 4/42 + 8) <1. 


Proof. It is necessary and sufficient that 6 + |y2| < 1. For x = 1 this means 


S {dall — cos ĝa) + €a|sinBal} <1, Waa. 


Since 01, ..., 0m are independent, the maximum of the sum is obtained by 
maximizing each term. Defining f(@) = d{1 — cos 0) + esinð, O0<8 <m, 
we have f'(8) = 0 ford =r- y, tany=c/d, 0< y< 7/2, resulting in 
max{f(0): 0 <8 < m}=d+ vec? +d?, which concludes the proof of part a). 


Continuing with part b), for the FOC scheme a necessary and sufficient con- 
dition is 


X {da(1 — cos ĝa) + rals sin ĝa — sin 2ĝa|} < 1, Yôa. 


Since |8 sin ĝa — sin 28a | < 10| sin 8al it is sufficient if 


So {da(1 — cosa) + cal sin bal} <1, Va. 


Determining the maxima of the individual terms as before, the proof of part b) 
is easily completed. o 


The corresponding time step restriction is found to be 
r< {DS h3 (2+ V4 + q2), 


with q = 1 for the second order central (x = 1) scheme and q = 25/9 for the 
FOC scheme. 


The one-dimensional version of part a) of this theorem has appeared in Chan 
(1984), with a less elementary proof. The following stability condition for the 
k = 1 scheme has been put forward on heuristic grounds in Schumann (1975) 
and has been generally used in practice since: 


X (2da +ca) <1. (5.78) 


a 
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A proof (less elementary than above) has been given in Pourquié (1994). 
Condition (5.78) is stronger than (5.77), cf. Exercise 5.8.8. 


One of the roots of the stability polynomial (5.76), say £2, is spurious; £1 is 
called essential and approximates the exact amplification factor exp{—r L(0)}. 
Ideally, one should have |£2| < |£1| (this is the case for the multistep schemes 
discussed before), but for D = 0, i.e. d = 0, |2| = |€1|. The trouble this 
causes may be seen from the fact that for D = 0 the numerical scheme con- 
tains a spurious undamped eigenmode p? = (—1)"+71+--+J™ if we disregard 
the boundary conditions. Furthermore, on the part of 0S where v+|w| = 1 we 
have |£2| = 1 > [£1]. In order to damp spurious modes, users of the leapfrog- 
Euler scheme stay well within S and usually apply the so-called Asselin filter 
(Asselin (1972)). After y”+? has been computed, y”+! is modified according 
to 


1 
ei = (lope + zele +e), 


with typically c € [0.05, 0.1]. 


Because of these difficulties, and because the temporal accuracy is only O(r) 
(although if convection dominates practical results come close to O(r7)) users 
now tend to favor the Adams-Bashforth scheme. 


Adams-Bashforth-Euler 


Instead of leapfrog one can also combine second order Adams-Bashforth with 
explicit Euler, avoiding the trouble with the spurious root and the need for ar- 
tificial damping. This approach has been used for the Navier-Stokes equations 
in Le Thanh (1992), Le Thanh, Troff, and Ta Phuoc Loc (1991), Verstappen 
and Veldman (1994). The method is given by 

3 


1 
yprte _ yt — -Ca( 3P t _ 5%") _ Dip} i 


The stability polynomial is given by 
3 1 
? y +8- 1)- =y. 
Oot &(57 + )~ 57 


As before, this being a mixed scheme we restrict ourselves to the case that 
y = 772. The boundary 0S of the stability domain S is given by 


ent — e'¥(Siw +v +1) + ziw =, O< pda, z=v+iw. (5.79) 


The stability domain is plotted in Fig. 5.5. For comparison, the stability 
domain of the second order Adams-Bashforth scheme is also shown. Sufficient 
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Fig. 5.5. Stability domain S of Adams-Bashforth-Euler scheme (—) with oval 
(- - -) and ellipse (---), and S for second order Adams-Bashforth scheme (- - — - - ). 


stability conditions are obtained in a similar way as for the Adams-Bashforth 
method. On ðS, near z = 0, expansion of (5.79) for |u| < 1 shows that we 
have v = — 4/4, w = p— Žu, hence w =% +(—4v)!/4. Choosing a = 1, on 
the oval of Theorem 5.7.3 we have near z = 0: w = +b(—2v)!/4. Hence, 
the osculating oval has b = 2!/4, In order to bring the oval partly inside 0S 
we take b = 1. Near z = —2 we find in a similar way that on 0S we have 
w = +y {v + 2)/3, whereas with a = 2 on the ellipse of Theorem 5.7.4 we 
have w = +b/v +2. We choose b = 1/73. The resulting oval and ellipse 
are shown in Fig. 5.5. Application of Theorems 5.7.3 and 5.7.4 to the second 
order central (x = 1) and FOC schemes gives, defining d = ` da: 
a 


Theorem 5.8.5. (Stability of Adams-Bashforth-Euler scheme) 


a)For the second order central scheme a sufficient von Neumann stability 
condition is 


d<1 and 2e 4 Jda)? <1 and 22 c? fda < <tu+Vi-@). 


6 
b) For the fourth order central scheme a sufficient von Neumann stability 
condition is 


d<1 and 2t 2 fda) > <q and 22 fda < H(t VIT), 


— 50 
with 


15+ 710 
L Etus yi — 1)°/3 = 0.6360. 


Corresponding stability restrictions on the time step r follow from (5.61), 
(5.62), (5.65) and (5.66). 


q= 
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Adams-Bashforth-Crank-Nicolson 


This is an example of a mixed scheme of IMEX (implicit-explicit) type. The 
idea is to enlarge the stability domain by treating the diffusion terms implic- 
itly. In practical applications, the convection term is often nonlinear, which 
is an incentive to treat this term explicitly. Second order Adams-Bashforth 
is applied to convection and Crank-Nicolson to diffusion. The method has 
been applied to the Navier-Stokes equations in, for example, Braun, Fiebig, 
and Mitra (1994), Cantaloube and Lé (1992), Esposito (1992), Gavrilakis 
(1993), Grotzbach and Wörner (1994), Huser and Biringer (1992), Kim and 
Moin (1985), Moin and Kim (1982), Takemoto and Nakamura (1986), Take- 
moto and Nakamura (1989), Wörner and Grétzbach (1992), Zang, Street, 
and Koseff (1994). The method is given by 


3 1 1 
en OP = Cre! — ge") = 5 Daler? +e"). 


The stability polynomial is found to be 


3 1 1 
ô) +€(-1 + 57+ 58) -37 


1 
2 
l4- 
Olt 5 
For the same reasons as for the two preceding schemes we restrict ourselves 
to the case that y = iy2. The boundary 0S of the stability domain S is given 
by 


5 5 zY 50, z=v+iw, 


and is plotted in Fig. 5.6, together with the parabola of Theorem 5.7.5 with 
b = 2/3 and the oval of Theorem 5.7.3 with a = 1/2, b = (3/4)1/4. We have 


; 1 ; 3 1 
eH — 5°) —e'”(1 + -iw+ —v) + i 
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Fig. 5.6. Stability domain of Adams-Bashforth-Crank-Nicolson (—) with oval 
(- - -) and parabola (---) 
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{parabola U oval} C S. If we now allow y, # 0, —rL£,(8) shifts to the left 
in the complex plane, because according to (5.35) we have yı > 0. A look 
at Fig. 5.6 shows that when —rL,(0) moves left it stays inside S. Hence the 
restriction yı = 0 can be dropped, so that the following theorem holds for 
the x- and FOC schemes. Application of Theorems 5.7.3 and 5.7.5 results in 


Theorem 5.8.6. (Stability of Adams-Bashforth-Crank-Nicolson scheme) 
A sufficient condition for von Neumann stability of the Adams-Bashforth- 
Crank- Nicolson scheme is 


~ 3 
d < 1/2 and {Y e4 /da)"!? < aG or Soca < gv3} 
Q oO 


or 
` c? fda < 035 


with q1, q2, q3 as in Theorems 5.7.3 and 5.7.8. 


Corresponding time step restrictions follow from (5.61)-(5.63) and (5.69), 
(5.70). 


Extrapolated BDF 


This is an IMEX version of BDF, in which the discretization of convection is 
made explicit by estimating y”+? by extrapolation from y"t! and p”. The 
method is given by 


This method has been studied in Varah (1980) for the one-dimensional case. 
The stability polynomial is 


(14 56) + 5€(y- +5 


2 
3 Y. 


3 3 


Again, this being a mixed scheme, we restrict ourselves to the case that 
y = iy2. The boundary of the stability domain @S is given by 


, 2 4. 1 
Pim Eo) — Few + I) +5 + Sw, z=u+iw, 
so that on 0S 
pata cosp — 9 | yp PPH- cos H) 
2 4cosp — 2cos 2p 2 cos u — cos 2y 
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Fig. 5.7. Stability domain of extrapolated BDF (—) with oval (- - -) and parabola 


(o) 


When 4cos u ~ 2 cos 2u = 0, i.e. for p = po = arceos(} — $V3) = 1.95, z = 


v+iw goes to infinity. In Fig. 5.7 we plot 0S for 0 < u < 1.73. For u> po, 0S 
continues in the right half of the complex plane (not shown). The shape of 
OS is not unlike that for Adams-Bashforth-Crank-Nicolson, and to obtain 
stability conditions we again use the oval and the parabola. For |u| < 1 we 
have v = ~2w4 on OS, and the oval of Theorem 5.7.3 osculates with @S in 
z = 0 if b = (2a/3)!/4. In order to stay inside 0S we take b slightly smaller; 
with a = 1, b = (1/2)!/4 we obtain the oval shown in Fig. 5.7. The parabola 
shown has parameter b = 1. For the same reason as for the Adams-Bashforth- 
Crank-Nicolson scheme we can allow 7, 4 0. Application of Theorems 5.7.3 
and 5.7.5 results in: 


Theorem 5.8.7. (Stability of extrapolated BDF scheme) 
A sufficient condition for von Neumann stability of the ertrapolated BDF 
scheme is 


~ 1 1 
d < 1 and {P ek/da)"? < a5) or Soca < aa(5)"/7} 


or 


Sock /da < 93 ; 


with @1,92,93 as in Theorems 5.7.3 and 5.7.5. 


Corresponding time step restrictions follow from (5.61)-(5.63) and (5.69), 
(5.70). 
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Runge-Kutta 


Flow problems are often solved by a Runge-Kutta method. Applications 
to the Navier-Stokes equations are described in Meinke and Hanel (1990), 
Spalart, Moser, and Rogers (1991). As an example we take the method pro- 
posed by Wray as described in Spalart, Moser, and Rogers (1991), designed 
to save computer storage. This is a three-stage Runge-Kutta method, defined 


by 


gp = yg —Tlang” , 
yp = gp -TLl +a"), (5.80) 
prt — y* _ tLa(yse"* 4 Cop") 


with parameters 


_8 _ 5 -3 g=-" &=-2 
n= 75? 12> 7g? BO |g AT GQ oF 79° 


It suffices to store three instances of y. We have 


pt! = P(~rL,)y", Piz) =1l4+2+ T + L , (5.81) 


where P is called the amplification polynomial. The exact solution satisfies 
p(t +r) = e777 p(t). 


Since P(z) = e7 + O(z*), the temporal error per time step is O(r*), so that 
the overall temporal accuracy of Wray’s Runge-Kutta method is O(r°). 


Stability analysis of Runge-Kutta methods proceeds as follows. Taking the 
Fourier transform of (5.81) results in 


get! = P(—rin()) 9 - 


The quantity P(—rLa (r)) is called the amplification factor. For von Neumann 
stability it is sufficient that 


|P(—rL,(0))| <1, v8. (5.82) 
The stability domain § of Runge-Kutta methods is defined as 
S={zEC:|P(z)| <1}. 
We see that (5.82) is satisfied if (5.41) holds, as before. 


The stability domain of Runge-Kutta methods is easily determined as follows. 
One solves numerically for r the equation |P(—p-+ re'“)| = 1, for given p and 
u. Then the point —p + re’” belongs to ðS. By varying p between 0 and 
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2x, OS is traversed. The value p should be chosen such that the straight line 
z = —p-+re' intersects the upper half of ðS in only one point and does not 
come close to being tangent to 05, in order to make the equation for r easy 
to solve numerically; p = 2 is found to be suitable in the present case. The 
stability domain is shown in Fig. 5.8, together with the ellipse of Theorem 
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Fig. 5.8. Stability domain of Wray’s Runge-Kutta method (—) with ellipse (- - -) 
and rectangle (— - —-) 


5.7.4 (parameters: a = 2.5127, b = V3) and the rectangle of Theorem 5.7.1 
(parameters a = 1.65, b = 1.7). The value b = v3 is chosen because 0S 
intersects the imaginary axis in z = iV/3, since P(iV3) = -i + łiy3. Thus 
we obtain 


Theorem 5.8.8. (Stability of Wray’s Runge-Kutta method) 
A sufficient condition for von Neumann stability of Wray’s Runge-Kutta 
method is 


with a = 1.65, b = v3, or 


~ a 2d 243 -2 ~ 
d< and pz 2 calda < (2 = n) (1+ 4/1 —- 4d? /a?) 


with a = 2.5127, b= V3. 


Corresponding stability restrictions on the time step follow from (5.50), 
(5.52), (5.53) and (5.65)—(5.67). The advantage of using the rectangle is that 
the resulting stability conditions are simpler than for the ellipse. 
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Another Runge-Kutta method, especially designed for convection-diffusion 
problems, has been proposed in Sommeijer, van der Houwen, and Kok (1994). 
We will call it the SHK method. It is a four-stage method, and is given by 


p= p'-—aitlhhyy”, 


y** — p” _ arlag” , 
PPE” 1 
pr = g” — 3 Thane 
yprtt — p” _ rly 


where a; = 1/4, a2 = 1/3. As for Wray’s method, three instances of p have 
to be stored. The amplification polynomial is given by 


P(z)=1+4+2(14 zl +a2z(1+4 a 12))). 


Since P(z) = e? + O(z”), the temporal accuracy of the SHK Runge-Kutta 
method is O({r*). 


The stability domain is determined in the same way as before, and is pre- 
sented in Fig. 5.9. Because |P(iV8)| = 1, ðS intersects the imaginary 
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Fig. 5.9. Stability domain of SHK Runge-Kutta method (—) with ellipse (- - -) 
and rectangle (— - —-) 


axis in z = i/8. Also shown are the ellipse of Theorem 5.7.4 (parameters: 
a = 2.7853, b = 2.55) and the rectangle of Theorem 5.7.1 (parameters: 
a = 1, b = 2.55). Again, the rectangle is included because the resulting 
stability conditions are simpler than for the ellipse. Stability conditions are 
obtained from Theorem 5.8.8 and equations (5.50), (5.52), (5.53) and (5.65)- 
(5.67) by inserting the appropriate values of a and b. The extra stage makes 
the SHK Runge-Kutta method about 33% more time-consuming than Wray’s 
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Runge-Kutta method, but by comparing the values for b we see that if convec- 
tion dominates, so that 7L,,{0) is near the imaginary axis, the CFL numbers 
Ca and hence the time step 7 can be about 50% larger. 


One may also formulate implicit Runge-Kutta schemes to enhance stability. 
An application to the Navier-Stokes equations is presented in Marx (1994). 


Mixed Runge-Kutta methods 


The idea is to treat diffusion implicitly to enhance stability, and convection 
(which is often nonlinear) explicitly to decrease computational cost, and to 
have O(r?) temporal accuracy for diffusion and O(r3) for convection. En- 
hanced accuracy for convection is beneficial for convection dominated flows 
and for turbulence simulations, in which one wishes to have dissipation gov- 
erned by physics and not by numerical errors in approximating convection. 
Such schemes have been applied to the Navier-Stokes equations in Baron 
and Quadrio (1994), Le and Moin (1991), Huser and Biringer (1993), Joslin, 
Streett, and Chang (1993), Rai and Moin (1991), Spalart (1988), Yang and 
Ferziger (1993). Let, for example, Wray’s Runge-Kutta scheme be applied 
to convection, and the w-scheme (to be discussed below) to diffusion. The 
resulting method, used by Le and Moin (1991), is given by 


yt = yg? D, (&rp” + Bip”) — Chorp” , 
yp = yp" — Dilarp* + Bp) Chlor + Cay”) , (5.83) 
prt) = p** — Di (ase + Bapt!) — Ch (osy™* + Gy’) , 


with 
4 1 — 1l — 8 — 5 
ai = ĝi = 7g, 02 = b = 7g, 03 = fs = g, 01 = Gey C2 = FZ 5.84 
3, 1 izre B (5.84) 
3=5:Q=-— B=- i. 


We will call this the WLM Runge-Kutta method. Stability can be studied as 
follows. By taking the Fourier transformation of (5.83) one obtains 


yet = Ry, 4) ¥5 , 


writing for brevity, as before, y = C,(0), 6 = D,(6), with y and ô given by 
(5.36) and (5.37). The amplification factor R is given by 


R(y,6) = R3{RoRi — Coy/(1 + B2d)} — Cay Ri/(1 + 83d) , 
with 
R; = (1 —a,d — ony) /O + 3,4), k= 1,2,3 . 


This being a mixed scheme, we restrict ourselves to the case of purely imag- 
inary y = ty2, i.e. the central second or fourth order space discretization of 
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the convection term, so that R is a function of the complex number ô + 742. 
To find the stability domain we write 


R(z) Ra{ RoR + iCgw/(1 — Bov)} + iGgwRhi/(1 — G30) 5 
Rk = (1+ anv + io,w)/(1— Bkv), k=1,2,3. 


(5.85) 
The stability domain S is defined by 

S={zEC:{R(z)| <1}. 
The locus of |R(z)| = 1 is identical to the boundary 0S of S, and is easily 


determined numerically in the same way as for the preceding Runge-Kutta 
schemes. The result is shown in Fig. 5.10. Because with ô = 0 the scheme is 
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Fig. 5.10. Stability domain of WLM Runge-Kutta method. 


identical to Wray’s Runge-Kutta method, 0S intersects the imaginary axis 
in z = iV3. Therefore we have stability if |y2| < v3. Hence, we may apply 
Theorem 5.7.1 with a = œ, b = V3. For the same reason as for Adams- 
Bashforth-Crank-Nicolson we may allow yı Æ 0. We obtain: 


Theorem 5.8.9. (Stability of WLM Runge-Kutta method) 


A sufficient condition for von Neumann stability of the WLM Runge-Kutta 
method ts 





v3 
doa Sa: 


a 


The corresponding time step restriction is 


r < SSS llua]/ħa) 
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The fact that the temporal accuracy for convection is O(r?) follows from the 
observation that if there is no diffusion we have Wray’s Runge-Kutta scheme. 
On the other hand, if there is no convection, we have, taking w = 0 in (5.85), 


R(z) = Ry Rz R3 = e* + O(2?) , (5.86) 


so that the temporal accuracy of the approximation of diffusion is O(r). 


The w-scheme 


This method is given by 
pth — e” = -rL t! + (1—-w)y”), (5.87) 


where the parameter w remains to be chosen. The temporal accuracy is O(r), 
but for w = 1/24 O(r) it is O(7?) (cf. Exercise 5.3.1). For w = 0 we have the 
explicit Euler scheme discussed before. For w = 1/2 the scheme is called the 
Crank-Nicolson scheme and for w = 1 the implicit Euler scheme is obtained. 
The Crank-Nicolson scheme has been applied to the Navier-Stokes equations 
in, for example, Choi, Moin, and Kim (1993), Choi and Moin (1994). 


Stability can be analyzed for a particular value of w by the method used 
before. But it is easy to treat the case 1/2 < w < 1 as follows: 


Theorem 5.8.10. (Stability of w-scheme) 
For 1/2 < w < 1 the w-scheme is unconditionally von Neumann stable. 


Proof. By taking the Fourier transformation of (5.87) we obtain 


a A A 1-—(l-—w)z 

ni _ n — 

$a =R(rLpA(P)) 66, R(z) = l+wz 

We have 7 L (8) =v +iw, v = +y, w= y: with ô, y1 and yz given by 
(5.35)-(5.37). We must show that |R(v + iw)| < 1. We have 


1—(1—w)v)? + (1—w)?w? 

R w) = EUe) + —w)Pvt 

/R(v + iw)| (1+ wv)? + ww? 

Because v > 0 and 1/2 < w < 1 the denominator is not smaller than the 
numerator. o 


The case 0 < w < 1/2 is not of interest, because one has to pay the price of 
solving an implicit system, without the benefit of increased temporal accu- 
racy, as for w = 1/2 + O(r). The case w = 0 has been treated in Theorem 
5.8.1. The w = 1 scheme has certain advantages that will be discussed later, 
namely that it is strongly A-stable, and that the condition on 7 for the scheme 
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to be of positive type (Definition 5.4.1) is weaker than for w < 1. For example, 
with first order upwind discretization, (5.87) is of positive type if 


r < ——/ Di lnal/he + 20/02) ; (5.88) 


It is left to the reader to show this. 


Further remarks 


A large number of time discretization schemes has been presented, and the 
reader may wonder whether it would not have been better to confine our- 
selves to a smaller number of examples. However, almost all of these schemes 
are encountered in practice, usually without much analytic information on 
their stability. Such stability results as there are are scattered through the 
literature, and are frequently confined to one space dimension, one particular 
spatial discretization and/or absence of convection or diffusion terms, and 
are often more conservative than the conditions presented above. Therefore 
the preceding general and unifying stability analysis is useful. The reader 
may also wonder by which criteria a suitable method is to be selected from 
the many alternatives available. This will be our next subject. The present 
section is concluded with some further remarks on stability analysis. 


Von Neumann stability analysis for multistep methods involves deriving con- 
ditions for the roots of the stability polynomial to be in the unit disk. Schur- 
Cohn theory, as described in Miller (1971), gives necessary and sufficient 
conditions on the coefficients. However, deriving stability restrictions on the 
time step from these conditions is usually an arduous task, that has to be un- 
dertaken anew for each scheme that one wishes to consider, and gets rapidly 
out of hand as the order of the multistep method increases. Hence, it is not 
surprising that few results have been published. Furthermore, Schur-Cohn 
theory is not applicable to Runge-Kutta methods, for which not the absolute 
value of the roots but of the amplification polynomial is not to exceed 1. 
In the foregoing we have presented an alternative approach. Based on The- 
orems 5.7.1-5.7.5, stability conditions are easy to find. Our approach does 
not get more complicated as the order of multistep methods increases, and 
applies equally to Runge-Kutta methods, Although not its principle, its ease 
of use in practice is affected if the stability polynomial depends on more than 
two parameters, so that the classical stability diagram of the time discretiza- 
tion method does not apply. This may happen if the terms in the partial 
differential equation under consideration are not all discretized in time by 
the same scheme, i.e. with hybrid schemes. Nevertheless, for such schemes 
the method may still work, for example, when central space discretization is 
applied to first order terms, as we have seen. 
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The principle of the method is applicable to general initial-boundary value 
problems in any number of dimensions, but Theorems 5.7.1—-5.7.5 have been 
derived for the convection-diffusion equation, with the «-scheme or the fourth 
order central scheme used for space discretization of the convection term. To 
illustrate the use of the method, von Neumann stability conditions are de- 
rived for a number of schemes. In most cases, sufficient conditions seem not 
to have been available before. The present method of von Neumann stability 
analysis has been proposed in Wesseling (1996c), Wesseling (1995). 


Because Fourier transformation is used, von Neumann stability analysis is 
restricted to linear equations with constant coefficients, uniform grids and 
periodic boundary conditions. With different boundary conditions, von Neu- 
mann stability is necessary for explicit schemes, because it takes some time 
before the influence of (perhaps stabilizing) boundary conditions makes itself 
felt in the interior. Stability analysis in the presence of non-periodic bound- 
ary conditions is difficult. When the coefficients are variable we need to have 
local von Neumann stability with the local value of the coefficients, because 
usually instability is a local phenomenon, with the high wavenumber (short 
wavelength) Fourier modes being the most unstable. But it is shown in Wid- 
lund (1966) that this is not sufficient for stability in the variable coefficient 
case. For more background, see Thomeé (1990). Sometimes stability can be 
proved in the presence of variable coefficients, nonlinearity and non-periodic 
boundary conditions by means of the energy method, see Morton and Mayers 
(1994) Sect. 5.8. See Richtmyer and Morton (1967) for other methods of sta- 
bility analysis, and Morton and Mayers (1994) Sect. 5.9 for a survey of the 
current situation, including the influence of boundary conditions. 


Exercise 5.8.1. Consider the dimensionless one-dimensional convection-dif- 
fusion equation: 


Show that Theorem 5.8.2 results in (replacing \/1—4d?/a? by 0) the fol- 
lowing sufficient condition for von Neumann stability of the AB2 scheme in 
combination with the «-scheme: 


T < min[h?Pe/(4+ (2—2n)hPe), max{q1(2h?/Pe)/3, qah V2}, 


hy/1/3/(2 — «)] . 


Show that Theorem 5.8.1 gives for the explicit Euler scheme: 


T < min[Peh?/(2+ (1 —x«)APe), 2Pe71/(2 ~ )*]. 
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Show that for h = 1077, Pe = 1074 and x = 1/3 this results in 7 < 0.0035 for 
AB2 and r < 7.2* 107° for Euler. We see that for this convection dominated 
case the stability condition is much more restrictive for explicit Euler than 
for AB2. Note that with 7 = 0.0035 the CFL number is 0.35. 


Exercise 5.8.2. As Exercise 5.8.1, but for the FOC and first order upwind 
scheme. 


Exercise 5.8.3. This exercise is about saving storage for the AB2 scheme. 
Asume a three-dimensional domain with a cubic n; x ng x ng grid. With lex- 
icographic ordering, the matrix La is block-tridiagonal with (nin2) x (nina) 
blocks B;,D;,Cj, j = 1,...,n3. Let yt = y"t?, y? = p” and let y” 
and y? be two nyn2-dimensional scratch vectors. Let y? = (y!,...,y%,) be 
a partioning in n,n2-dimensional blocks, and similarly for yt. Show that 
Ln(3e"*1 — p") can be computed as follows, disregarding boundary modifi- 
cations: 


y? — 3y! — y? 
for j=l,...,n3 do 

yl =y? 

y? = Byy? + Djy! + Cyyf ai 
end 


Show how "°t? can be computed without using additional storage. 


Exercise 5.8.4. Show that the parameters of the half ellipse of Theo- 
rem 5.7.4 that osculates with 0S of the AB3 scheme in z = —6/11 are 
a = 6/11, b= 72 2/235. (Hint: develop the stability polynomial with 
€ = e'# in a power series around u = 7; cf. our treatment of the AB2 scheme). 


Exercise 5.8.5. As Exercise 5.8.1, but for AB3 scheme. Show that a suffi- 
cient stability condition is: r < 0.0026. 


Exercise 5.8.6. Show that the leapfrog scheme applied to the convection- 
diffusion equation is unstable for every r > 0. 


Exercise 5.8.7. Show that the temporal accuracy of the third order Adams- 
Bashforth scheme is O(7°). 


Exercise 5.8.8. Show that (5.78) implies (5.77). 
Exercise 5.8.9. Prove (5.86). 
Exercise 5.8.10. Prove (5.88). 
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5.9 Numerical experiments 
Test problem 


In order to motivate the introduction, besides their stability conditions, of 
additional criteria to judge the merits of discretization schemes, some nu- 
merical experiments will be presented for the following one-dimensional test 
problem: 


ðp = ay Pp 
at dn ee 
q(t, £) = B? D cos B(x — ut) , 


1 t<T 
O<ar<l, 0<t<T, (5.89) 


with D and u > 0 constant, and 6 a parameter. An exact solution is given 


by 
p(t, z) = cos B(x — ut) + e7% Dt cosa(z — ut) , (5.90) 


with a arbitrary. Spatial discretization is done with the second order central 
(« = 1) or with the first order upwind scheme on a uniform cell-centered 
grid with mesh size A. For temporal discretization the w-scheme is used. The 
resulting scheme can be written as 


yer — yp; + wLrgr + (1l-w)Lay} = 74; (5.91) 


with, in the interior, 


1 . 
Lay; = 57 (Pi+1 ~ pj-1) + d(—pj-1 + 29; — Pit), JG = 2,3,..,73-1, 
(5.92) 


where o = ur/h, d = Dr/h*. The grid is a uniform version of the grid 
depicted in Figure 4.1. 


Choice of time step and mesh size and time scale 


The length scale £ and time scale 7 of the exact solution are given by 
£L=n/max(a, 8), T =min{L/u,(Da*)~"}, 


where we take for the length scale of a harmonic function half its wavelength. 
We may expect accuracy to be sufficient if r< T, h < L. For efficiency, we 
would like to avoid more stringent restrictions on 7 and A, such as might arise 
from stability or the positive type property. In the numerical experiments to 
be described we take a = 47, B = 2m. We take mostly h = 1/30, giving 
h/£ = 0.18. 
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We note that the discrete operator is of positive type (Definition 5.4.1) if, in 
the case of the second order central scheme, 


h2 
and p= [ih <2, (5.93) 


L o 
"30 —o)D, 


whereas in the case of the first order upwind scheme we must have 


2 





r< A /(2D + |ulh) . (5.94) 


— w 
The first conditions of (5.93) and (5.94) arise from the temporal discretization 
(and is different for other discretizations). If their violation causes spurious 
wiggles (the risk is greatest if the solution is nonsmooth), then 7 should be 
decreased, or something should be done about the temporal discretization. 
This will lead us to the concept of strong stability in the next section. The 
second condition of (5.93) is the same as in the stationary case. How the 
situation may be handled is described in Chap. 4. For example, one may use 
the x-scheme with limiting. 


Dirichlet-Neumann boundary conditions 


First we prescribe a Dirichlet condition at z = 0 and a Neumann condition at 
z = 1. Initial and boundary conditions are chosen conforming with the exact 
solution (5.90). With the second order central scheme, we have a discrete 
operator of positive type for w = 1 if the mesh Péclet number p = uh/D 
satisfies p < 2. The left half of Fig. 5.11 shows a result. In this case we have 
T = 0.23, 7 = 0.157, p = 1.83. The accuracy is disappointing. The cause is 
that with w = 1 the scheme is only first order in time, Making 7 four times 
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Fig. 5.11. Exact (-—) and numerical solution (o) of (5.89). Second order central 
discretization; w = 1, u = 1.1, D = 0.02, h = 7 = 1/30, t=1, a= 4r, B = 2r. 
Left: Dirichlet-Neumann boundary conditions; right: periodic boundary condition. 
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Fig. 5.12. As the preceding figure, but with 7 = 1/120. 


as small gives the more accurate result of Fig. 5.12. Almost the same results 
(not shown) are obtained with the w = 1/2 scheme, but with r much larger, 
namely r = 1/30. Hence, the w = 1/2 scheme is more efficient than the w = 1 
scheme in this case. We have r < h?/{2(1 —w)D} = 1/18 and p = 1.83, so 
that according to (5.93) the scheme is of positive type. The CFL number 
c = |ulr/h and diffusion number d = 2Dr/h? have values c? = 1.21 and 
d = 1.2, so that the w = 0 scheme is instable according to Theorem 5.8.1. 


We continue with w = 1/2. In the preceding cases the discretization is of 
positive type, so that the maximum principle (Theorem 5.4.1)) holds, and 
spurious wiggles are excluded. In Fig. 5.13, where we have decreased D and 
where p = 7.3, the scheme is not of positive type, and we find spurious wiggles 
near z = 1. However, £ and 7 are the same as before, so we would like to 
keep 7 and A the same for efficiency. Just as in the stationary case discussed 
in Chap. 4, spurious wiggles may be avoided by local mesh refinement near 





Fig. 5.13. As the left part of Fig. 5.11, but with w = 1/2 and D = 0.005. 
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r = 1, or by applying a homogeneous Neumann condition at z = 1, which 
gives the result presented in Fig. 5.14. Note that with the ‘wrong’ homoge- 
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Fig. 5.14. As Fig. 5.11, but with a homogeneous Neumann condition at x = 1 for 
the left figure, and w = 1/2, D = 0.005. 


neous Neumann condition the accuracy is much better than when g(t, 1)/Oz 
is derived from the exact solution. The reason is that an artificial numerical 
boundary layer at z = 1 is avoided, and that the upstream influence of the 
‘wrong’ outflow boundary condition is restricted to a strip of width O(D), as 
explained in Sect. 2.5. 
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Fig. 5.15. Exact (—) and numerical (0) solution of equation (5.89). Second order 
central scheme; w = 1/2, u = 1.1, D = 0.0005, h = 7 = 1/30, t = 0.8333. Left: 
homogeneous Neumann outflow condition; right: periodic boundary conditions. 
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Transparent boundary condition 


When D is decreased further, spurious wiggles gradually show up again, as 
seen in Fig. 5.15. They are generated at the boundary x = 1. The reason 
is that with D small we are close to solving a wave equation, and spurious 
reflection occurs at the Neumann boundary. In solving wave problems it is 
frequently the case that a large domain is cut off by an artificial boundary, to 
make the problem numerically tractable. Care has to be taken that outgoing 
waves travel unhampered through the artificial boundary, and do not reflect. 
To this end a transparent boundary condition (or weakly reflecting, or non- 
reflecting, or absorbing) has to be applied. Principles for deriving transparent 
boundary conditions are set forth in Engquist and Majda (1979). We will not 
discuss these principles, but use the results. Transparent. boundary conditions 
for the convection-diffusion equation are studied in Hall (1986). There the 
following second order accurate transparent boundary condition is derived 


for (5.89): 
oP yuck =0 at z=1. (5.95) 


This boundary condition is implemented numerically in the following way. 
We discretize (5.95) with the interior scheme (5.91), (5.92) in the boundary 
cell 7 = J, using a virtual point J + 1: 


O 
> + z, est — ys~1) = 0. (5.96) 


Next, the spatial discretization of the differential equation is extended to the 
cell 27 using the virtual point J + 1: 


d o d 
a + glp = pIi) + = (-Ps-1 + 2y pI) =g. (5.97) 


2T 
Elimination of the virtual value y 741 results in the following boundary 
scheme: 


dp; c _ 6 
at -=(P — pJ-1) = PL (5.98) 


This will be used as a transparent boundary condition. 


Fig. 5.16 shows a result, which is much more satisfactory than that of 
Fig. 5.15. No boundary reflection is discernible. This shows the importance of 
using transparent outflow conditions for nonstationary convection dominated 
problems. 


If diffusion dominates, the transparent outflow condition (5.98) gives wrong 
results. This is easily seen by taking ø = 0, in which case (5.97) determines 
vy’, completely, thus invoking a wrong Dirichlet boundary condition. But in 
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Fig. 5.16. Exact (—) and numerical solution (o) of 5.11. Second order central 
discretization; w = 1/2, u = 1.1, D = 0.0005,hk = 7 = 1/30,¢ = 0.8333, transparent 
outflow condition. 


a reasonable mathematical model, if diffusion dominates, a physical bound- 
ary condition should be available at an outflow boundary. So there should 
be no need for an artificial outflow condition. In any case, the transparent 
outflow condition (5.98) should be selected only if the mesh Péclet num- 
ber p = uh/D = o/d is large enough, for example p > 2; otherwise, if no 
physical information is available, a homogeneous Neumann condition is to be 
preferred. 


Periodic boundary conditions 


Accuracy is enhanced in the preceding examples by the circumstance that 
at £z = 0 the correct value is continuously fed in by the Dirichlet boundary 
condition. With a periodic boundary condition y(t, 1) = p(t, 0) this is not the 
case, and the influence of discretization error is greater. The exact solution 
(5.90) is a combination of a damped and undamped traveling wave, and the 
numerical error may be broken down in dissipation (wrong amplitude) and 
dispersion (wrong propagation velocity). These concepts will be discussed 
more fully in Chap. 8. 


Results with the periodic boundary condition are presented in the right parts 
of the preceding figures. In Fig. 5.11 we see that there is too much damp- 
ing. This is due to the temporal discretization with w = 1; with r smaller 
(Fig. 5.12) or with w = 1/2 (not shown) the numerical amplitude is more 
or less correct. Figs. 5.14 and 5.15 show that with the periodic boundary 
condition the scheme is not prone to spurious wiggles. The figures also show 
that the error is mainly due to the numerical propagation velocity being too 
small (dispersion). 
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Nonsmooth solution 


In the following example the exact solution is not smooth. Again we consider 
(5.89), but with g = 0. The initial and boundary conditions are 


y(0,2) = -10<2<1/2, y(0,2)=1, 1/2<2<1; 
y(t,0) = 1, 2k-1/2< ut < 2k + 1/2; (5.99) 
p(t,0) = —1, 2k+1/2< ut < 2k +3/2; 


Op(t,1)/dx = 0 


for k integer, so that in the absence of diffusion the solution is a periodic 
step-function propagating with velocity u. We have not. determined the exact 
solution for D > 0. 


First the diffusion equation is considered (u = 0). Fig. 5.17 shows a result. 
When 7 is increased beyond 0.01 the w = 0 scheme becomes unstable. With 











Fig. 5.17. Numerical solution (o) of equations (5.89), (5.99) with q = u = 0; 
—: initial condition; w = 0, D = 1/50, k = 1/50,7 = 1/110, ¢=3. 


w = l and 7 = 1/15 almost the same result (not shown) is obtained as in 
Fig. 5.17. With w = 1/2 better temporal accuracy is obtained, but we have 
an operator of positive type only if r < h?/D (cf. (5.93)), ie. d < 2. If this 
condition is not met numerical wiggles may occur. These indeed show up for 
smal] time, as seen in Fig. 5.18, where d = 6.7. Since here the convection 
term is absent, the temporal discretization is to blame. Improvement of the 
temporal discretization will be discussed in the next section. 


Next, the convection-diffusion equation is considered. The transparent bound- 
ary condition is applied at z = 1. Although we do not know the exact solution, 
we can say that qualitatively the exact solution is a periodic step-function 
that is smoothed as time increases and propagates with velocity u. When 
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Fig. 5.18. As the preceding figure, but with w = 1/2, T = 1/15, t=0.2 


the discretization is of positive type there are no spurious wiggles, but with 
the present nonsmooth initial condition wiggles show up immediately when 
this not the case, as in Fig. 5.19, where the mesh Péclet number p = 9 > 2. 
Switching to first order upwind discretization restores monotonicity, but the 








Fig. 5.19. Problem and symbols as in Fig. 5.17. Second order central discretization; 
w = 1/2, D = 0.002, h=7=1/50, t = 0.4, u = 0.9. 


result shows much false diffusion, cf. Fig. 5.20. A correct solution is shown 
in Fig. 5.21, obtained with p = 1.8. But this is an inefficient computation, 
because 7 and h are very small compared to the physical scales 7 and £. 


The following conclusions may be drawn. If the second order central scheme 
is of positive type, good results are obtained. But conditions (5.93) are very 
restrictive. An excess of numerical damping can be caused by the time dis- 
cretization (w = 1) or by first order upwind spatial discretization. Better tem- 
poral accuracy is obtained with w = 1/2, but if (5.93) is violated, numerical 
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Fig. 5.20. As the preceding figure, but with first order upwind discretization 





Fig. 5.21. As the preceding figure, but with 7 = 0.008 and A = 0.004 


wiggles may occur if the exact solution is not smooth. In wave propagation 
problems the amplitude and the propagation velocity may be inaccurate due 
to spatial and/or temporal discretization errors. 


To sum up, both temporal and spatial discretization need improvement rela- 
tive to the methods discussed in this section. In the following, methods will be 
discussed that have gained acceptance in contemporary computational fluid 
dynamics. 


Exercise 5.9.1. Carry out computational experiments for the problems for 
which the figures in this section give results. Experiment with defect correc- 
tion. 
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5.10 Strong stability 


As mentioned before, for efficiency the step sizes 7 and hg, should be governed 
only by the physical scales 7 and £ and preferably not by stability. This may 
be achieved by choosing methods whose stability domains satisfy certain cri- 
teria. These may depend on the application, for instance, on whether diffusion 
dominates or convection. A method is called A-stable if its stability domain $ 
includes the second and third quadrant. Then the method is unconditionally 
stable for the convection-diffusion equation. An interesting theorem about 
A-stable schemes is Dahiquist’s second order barrier theorem (see Hairer and 
Wanner (1991)), which says that the time accuracy of A-stable schemes is at 
best O(r*). If the stability domain S includes the negative real axis, we will 
call the scheme R-stable. The advantage is that stability is not restricted by 
the diffusion term, because this term contributes only to the real part of the 
symbol of the scheme. As a consequence, instead of a stability condition of 
type T = O(h?) one has 7 = O(h). 


We will now formulate an additional requirement for A- or R-stable schemes. 
When this requirement is satisfied, spurious wiggles due to non-smoothness 
of the exact solution are damped. Such wiggles are seen in Fig. 5.18. 


Strong stability 


Let q = 0 in (5.89). By postulating a solution of the form y(t, £) = y(t)e!??/", 
the Fourier transform of (5.89) is obtained as 


dy/dt = dy, A= —D(6/h)? —iud/h. (5.100) 


For our present purpose it is not necessary to discretize in space. We discretize 
(5.100) in time. Taking the w = 1/2 scheme as an example one obtains 


n n 1 nm n 
ytty = 5Arly +t). 


The stability polynomial is (1 — $Ar)€ — 1 — $Ar, with root 


1 
2 


— 1+år/2 
& = 1- Ar/2 (5.101) 


The numerical solution is given by 
y= ty’. 


For temporal discretization schemes in general we have 


K 
y” =Y Okek. 
k=1 
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with Ck constants and &,...,E« the roots of the stability polynomial. The 
exact solution is given by 


y(nr) = e™Ty(0) . 


We now make the following observation. We have with À given by (5.100): 
Je"7| = e274" with d= 2Dr/h*. We see that Fourier components with 8 
large get damped quickly. However, for the w = 1/2 scheme this is not the 
case. On the contrary, if |@| — oo we have A + —oo and according to (5.101) 
€,; —> —1. This explains the wiggles in Fig. 5.18. Of course, there A is different 
due to spatial discretization, and is given by, since u = 0, 


Ar = —rL,,(0) = —2d sin? 6/2 , 


where (5.33) has been used. The Fourier mode with the shortest wavelength 
that can be represented on the grid has 0 = ~r. In Fig. 5.18 we have d = 20/3, 
so that for 0 = 7 one finds £; = —17/23 = —0.74. This explains why spurious 
wiggles occur during the first time steps in the case of Fig. 5.18. These may be 
diminished or avoided altogether by requiring that the numerical scheme, like 
the differential equation, strongly damps short waves if diffusion is dominant. 
This leads to the following definition. 


Definition 5.10.1. Strong stability 

A discretization method is called strongly A-stable or strongly R-stable if it 
is A-stable or R-stable, respectively, and if when applied to dy/dt = Ay the 
roots €1,...,€« of its stability polynomial satisfy 


lim |&| <1, k=1,..K. 
à=- 900 


Of course, since the amplification factor e*7 of the exact solution satisfies 
lim-o |e*”| = 0, the smaller limy_,_ oo êk], the better. 


It is not difficult determine which of the A-stable or R-stable schemes dis- 
cussed in Sect. 5.8 are strongly A-stable or R-stable. The stability polynomial 
of the BDF scheme was found in Sect. 5.8 to be (1— 2 2\g? — teg t with in the 
notation of the present section z = TÀ. As z => ~œ the roots satisfy E12 > 0, 
so that BDF is strongly A-stable. Extrapolated BDF is strongly R-stable, to 
show this is left to the reader. From (5.101) we see that the w = 1/2 scheme 
is not strongly A-stable. The remaining R-stable schemes of Sect. 5.8 treat 
the diffusion term in a similar way as the w = 1/2 scheme, and turn out to 
be likewise not strongly R-stable. To show this for Adams-Bashforth-Crank- 
Nicolson is left to the reader. Although for Runge-Kutta schemes stability 
was studied by means ofan amplification factor rather than the stability poly- 
nomial, the stability polynomial is readily defined, so that Definition 5.10.1 
is easily extended to Runge-Kutta schemes. Application of Runge-Kutta to 
(5.100) gives 
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yr th = R(rA)yn , 


with R the amplification factor introduced in Sect. 5.8. Hence the stability 
polynomial is € — R, with root €; = R. For the WLM Runge-Kutta method 
R(z) is given by (5.85) with z = v + iw = —d — iy. We have lim;_,.. R(z) = 
—1, so that the method is not strongly R-stable. 


The multistage w-scheme 


The w-scheme is strongly A-stable if w > 1/2. But this reduces the tem- 
poral accuracy from O(7?) to O(r). By using the w-scheme in multistage 
fashion, O(r?) accurate and strongly A-stable or R-stable schemes may be 
obtained, at similar computational cost as the w = 1/2 scheme. Such meth- 
ods have been proposed and analyzed in Cash (1984), Gourlay and Morris 
(1980), Gourlay and Morris (1981), Lawson and Morris (1978), Glowinski 
and Périaux (1987), Bristeau, Glowinski, and Périaux (1987). The usefulness 
of these schemes for solving the Navier-Stokes equations is shown in Glowin- 
ski and Périaux (1987), Bristeau, Glowinski, and Périaux (1987), Rannacher 
(1989), Randall (1994), Turek (1994). A time step consists of three stages. 
Applied to dy/dt = ày +q the multistage w-scheme is defined by, writing 
z= TA, 


(l-—awz)y*t” = (1+ a wz)y” +wrgq”, 
(1 — aw z)yttt-# = (1+ aw zjy t” +w rg tHe, (5.102) 
(1— awz)y"*? = (1+ a we yr ti-e 4 rg tl 
w =1- 2w, a =1l-a, 


where the parameters a and w remain to be chosen. The three stages have 
time steps wr, wr and wT, respectively. Each stage is in fact an w-scheme, so 
that the cost of a time step is three times the cost of the w-scheme. However, 
this is compensated by the fact that r can be chosen larger for the same 
accuracy. The amplification factor is 


_ (+ a wz)?(1 + aw z) 


g(z) = (1 awz)?(1 — a'w" z) . (5.103) 


For accuracy, g(z) must approximate e” for small z. We have 
1 
g(z) =1l4+z4+ 57 {1 + (1 — 2a)(2w? — 4w + 1)} + O(z3). 


Second order accuracy is obtained if g(z) = e? + O(z3). This is the case if 
either a = 1/2, which brings us back to the Crank-Nicolson scheme, or 


w = 1 — 1/v2 = 0.292, (5.104) 
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which is used in the multistage w-scheme. If we choose a such that aw =a'w , 
j.e. 


a = (1 — 2w)/(1 — w) = 2 — V2 = 0.585, (5.105) 


then the coefficients in the left sides of (5.102) are the same, and hence also 
the matrices of the systems to be solved when applying the multistage w- 
scheme to (5.89). We have the following theorem. 


Theorem 5.10.1. Strong A-stability 
If 
: <a<l 
9 — 
then the multistage w-scheme is strongly A-stable. 


Proof. The amplification factor g(z) given by (5.103) has poles only on the 
positive real axis, at z = 1/aw and z = 1/(1—a)w . Furthermore, 


Jim, Ig) za /a<l. (5.106) 


Hence, g(z) is analytic for Re(z) < 0. On the imaginary axis, writing 
a = ca, z= ty, tawy = iv, we have 


(1 + c?v7)?(1 + 2v?) 

(1 + v?)?(1 4 2c?v?) 

(ct — 1)v* + 2(c* ~ c?)v® 
(1 + v?)2(1 + 2c?v?) 

From (5.105) it follows that c = a/(1 — a) < 1, so that 


lg(éy)|? 


lg(iy)? <1. (5.107) 


Consider the closed contour consisting of the half-circle z = re’®, 7/2 <0 < 
37/2 and the segment of the imaginary axis z = iy, —r < y < r. Inside this 
contour g({z) is analytic, and according to the maximum modulus principle, 
inside the contour |g(z)| is bounded by its maximum on the contour. Letting 
r — oo, we conclude from (5.106) and (5.107) that g|(z)| < 1 for Re(z) < 0, 
which shows A-stability. Furthermore, according to (5.106), 


lim |g(z)| =a /a<1, 


z= - OO 
so that we have strong A-stability. o 


The choice a = 1 (i.e. a’ = 0) seems attractive, because then 


lim |g(2)|=0, 


ing CO 
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as for the exact amplification factor e* of the differential equation. Then 
transient wiggles as those in Fig. 5.18 will surely be absent. But one would 
also like to approximate the exact amplification factor in another respect, 
namely undamped propagation of waves: 


lg(iy)} S [ge(¢y)| = 1. 
With a = 1 one finds 
0.9 < |g{iy)| <1 for 0<|yj/<3, (5.108) 
whereas with a = (1 — 2w)/(1 — w) one has 
0.9 < |g{iy)| <1 for 0<|y/ <6, (5.109) 
which is more attractive, For this value of a, 
\g(—00)| = 1/V2 = 0.702..., 


so that rough transients will be damped, but it will take a few time steps. 


Application to the convection-diffusion equation 


Application of the multistage w-scheme to the spatial discretization of (5.89) 
gives 


pete - p} + awt Lage t” + awl? = WT} , 

ynti-w _ pire 4 sorlet + aw Lng? t” = (5.110) 
w rgt” , 

ntl _ pt 1—w + awtLyp"t} ta wrL,yti-# — wrq”t+! , 


where L, is defined by (5.28) and (5.29). The amplification factor is given by 
g(8) =9(z), z= -r La (0) ' 


with g(z) given by (5.103) and r£a(@) by (5.34). Since Re({~rÈa(0)) < 0 
and the multistage w-schema is A-stable, the scheme is unconditionally zero- 
stable and absolutely stable. 


Let us now see whether the multistage w-scheme is more attractive than 
the w-scheme. Application to the test case of Fig. 5.18 gives the results of 
Fig. 5.22. In order to keep the cost of the multistage w-scheme the same as 
for the w = 1/2 scheme, for Fig. 5.22 7 is three times as large as for Fig. 5.18. 
Although some wiggles occur in Fig. 5.22, they are smaller than in Fig. 5.18. 
If d = 0, u # 0 we have undamped wave propagation; therefore we would 
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Fig. 5.22. Problem and symbols as in Fig. 5.18. Multistage w-scheme; second 
order central discretization; w = 1 — 1/ v2, a = 2 — V?, D = 0.02; u = 0; 
h = 1/50, r= 1/5, t = 0.2. 


like to have |g(8)| = 1 in this case. With d = 0,7L,(0) = ic; sin@. From 


(5.109) it follows that 0.9 < |g(icı sin@)| < 1 for 0 < cı < 6 for all @ (for the 
w = 1/2 scheme, |g(tc; sin @)| = 1 for all cı and 8, which is better, of course). 
The multistage w-scheme gives results that are virtually indistinguishable 
from those of Figs. 5.14-5.18 with 7 three times as large as for the w = 1/2 
scheme. Figure 5.23 gives results for almost undamped wave propagation for 
long time. Here 


q=0, (5.111) 
and the exact solution is given by 
p = exp(~D6*t) cos p(z — ut), B= 37, (5.112) 


with corresponding initial condition, Dirichlet condition at x = 0 and ho- 
mogeneous Neumann condition at z = 1. Numerical dissipation is seen to be 
negligible, because the amplitude of the wave is faithfully represented. For 
the exact solution (5.112) the time and length scales are 7 = 1/3u = 0.3 and 
L = 1/3, so that 7 and h are not unreasonably small, whereas the accuracy 
is reasonable. The wiggles are due to the central discretization of the con- 
vection term. Note that the mesh Péclet and CFL numbers satisfy p = 73.3 
and cı = 3.3. The monotonicity of the solution will be improved later in this 
section. 


We may conclude that the multistage w-scheme is more attractive than the 
Crank-Nicolson (w = 1/2) scheme for temporal discretization. 
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Fig. 5.23. Exact (—) and numerical (0) solution of (5.89), (5.111). Multistage w- 
scheme, second order central discretization; w = 1 — 1/2, œ = 2? — V2, D = 
0.0005, u = 1.1, k = 1/30; 7 = 0.1; t = 10. 


Choice of temporal discretization method 


Because we have treated the w-scheme and its multistage version rather ex- 
tensively, the reader may be led to think that this is the method of choice. 
But the situation is not so clearcut. There are many possibilities for temporal 
discretization. For practical applications methods are mainly chosen on the 
basis of the following considerations. 


(i) Low artificial diffusion due to discretization of the convection term. This 
is desirable if the temporal behavior of time-dependent flow is to be com- 
puted accurately. Physical oscillations should not be damped prematurely, 
and instable flows should not be stabilized artificially. This is an important 
consideration in large-eddy or direct simulation of turbulence. Since the con- 
vection term in the differential equation causes no dissipation and its symbol 
is purely imaginary (cf. (5.100)), it is desirable that the symbol of the spatial 
discretization of the convection term (given by (5.34) for the «-scheme), is 
close to the imaginary axis. As far as the temporal discretization is concerned, 
we can say the following. For the differential equation the amplification fac- 
tor for one time step is exp(AT) with A given by (5.100). In the absence of 
diffusion (D = 0) its modulus is 1. With D = 0 the symbol of the spatial 
discretization should be close to the imaginary axis, as argued above, and fur- 
thermore, the modulus of the amplification factor (for single step schemes) 
or of the dominant root of the stability polynomial (for multistep schemes) 
should be close to 1. That means that the boundary ðS of the stability do- 
main should be close to a sizeable segment of the imaginary axis. This rules 
out, for example, the use of the explicit and implicit (w = 1) Euler schemes 
for convection. 
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(ii) Accuracy. Consideration (i) leads automatically to the use of O(r7) 
accurate methods or better for convection. If diffusion is important (Péclet 
number not large) the diffusion term must also be approximated with O(r7) 
accuracy. If diffusion is small (large Péclet number), O(7) accuracy may suf- 
fice for the diffusion term. 


(iii) Stability and efficiency. Explicit schemes generally lead to stability 
conditions of type T = O(h) for the convection term and r = O(h?) for the 
diffusion term. If the latter condition is too severe, for example due to locally 
small A in realistic applications, then the diffusion term must be discretized 
implicitly, at additional cost per time step; this may be balanced by the fact 
that r can be increased. As mentioned at the beginning of Sect. 5.9, for ef- 
ficiency it is desirable that + and h are constrained not by stability but by 
accuracy, i.e. related to 7 and £, the time and length scales of the exact 
solution. If a stability condition of type T = O(h) is not too severe, then con- 
vection may be treated explicitly, using an IMEX scheme. Because in practice 
convection terms are usually nonlinear, this decreases the computing cost of 
implicit schemes. If spurious wiggles occur due to nonsmoothness of the so- 
lution, then a strongly stable scheme should be used, or in the explicit case, 
r should be decreased in order to move away from 0S (per definition, on 0S 
we have no damping). If spurious wiggles are due to the spatial discretiza- 
tion (i.e. they persist when steady state is approached), then the measures 
discussed in Chap. 4 should be taken. In multidimensional problems with 
thin boundary layers, in which the spatial mesh size in a particular coordi- 
nate direction is much smaller than in the other directions, it may pay off to 
treat this direction implicitly and the other directions explicitly. In hyperbolic 
and convection domained problems, not only should numerical dissipation be 
small, but also dispersion, i.e. numerical wave propagation velocities should 
be close to the physical ones. We will come back to this in Chap. 8. 


Defect correction 


Defect correction is a way to improve the spatial discretization. For example, 
to improve the accuracy of the first order upwind scheme, while keeping spuri- 
ous wiggles small, defect correction may be applied, just as in the stationary 
case (Sect. 4.8). As an example, consider one-step temporal discretization 
(such as the multistage w-scheme). With first order upwind or second order 
central spatial discretization, this leads to systems of the following type: 


Bap t! + Aug” = gh, (5.113) 
Bep"** + Aep = e, (5.114) 
where the subscript u refers to upwind and c to central discretization. Defect 


correction for time-dependent problems consists of the following steps, for 
each time step: 


5.10. Strong stability 225 


(i) Solve (5.113) and call the result pl). 


(ii) For k = 1,..., K: 

By dy") =q? — Bop“) — Ag”, pl) = pl) tipt; (5.115) 
(iii) prt? =p). 
Step (ii) represents K defect corrections. If step (ii) is repeated until conver- 
gence so that dp) — 0, (5.114) has been solved (in an expensive manner). 
But a middle course between (5.113) (low accuracy) and (5.114) (risk of spu- 


rious wiggles) is steered by taking K = 1. As shown in Sect. 4.6, this gives 
second order accuracy. 


Applying defect correction with K = 1 to each successive stage of the mul- 
tistage w-scheme to the test case of Fig. 5.23 results in Fig. 5.24. A marked 
improvement is obtained compared to Fig. 5.23. One might think that the 
price for this is doubling of computing time, because now we have to solve 
twice per stage with matrix B,, versus once with Be without defect correc- 
tion. But in realistic computational fluid dynamics problem solving with Bu 
{an M-matrix) is much faster than solving with Be; more on this in Chap. 7. 


Defect correction seems a good way to improve upon the accuracy of a 
first order scheme of positive type, without incurring too much of the non- 
monotonicity that may pollute results of non-positive second order schemes. 
In hyperbolic or convection dominated problems with nonsmooth solutions, 
the second order central scheme still introduces too much non-monotonicity. 
Then defect correction should be applied to the combination of the first order 
upwind and a flux-limited «-scheme. 








Fig. 5.24. Same case as Fig. 5.23, but with one defect correction based on first 
order upwind and second order central discretization 
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Exercise 5.10.1. Repeat the numerical experiments of this section. Does 
the accuracy improve when the number of defect corrections is increased? 


Exercise 5.10.2. Choose a = 1 in the multistage w-scheme and recompute 
Figs. 5.22 and 5.23. Explain the differences that you observe. 


Exercise 5.10.3. Verify Theorem 5.10.1 and equations (5.108), (5.109) by 
making a contour plot of |g(z)| (using a contour plotting program from the 
website www.netlib.org or in MATLAB). 


Exercise 5.10.4. Show that the extrapolated BDF scheme is strongly R-stable, 
and that the Adams-Bashforth-Crank-Nicolson scheme is not. 


Exercise 5.10.5. Formulate defect correction for the second order Adams- 
Bashforth and BDF schemes. 


6. The incompressible Navier-Stokes equations 


6.1 Introduction 


In this chapter, the incompressible Navier-Stokes equations in Cartesian co- 
ordinates discretized on uniform grids will be considered, discussing most of 
the basic numerical principles in a simple setting. In practical applications, of 
course, nonuniform grids and general coordinate systems are prevalent; these 
will be studied in later chapters, as will be the compressible case. 


We can be relatively brief in discussing discretization of the Navier-Stokes 
equations, because we prepared the ground in our extensive discussion of the 
convection-diffusion equation in Chapters 3, 4 and 5. Therefore it wil not be 
necessary to discuss again the various possibilities for discretizing convection, 
or von Neumann stability conditions. 


Only the primitive variable formulation will be discussed. This means that 
the velocity components and the pressure will be used as unknowns. This is 
because using other unknowns, such as the streamfunction and the vorticity, 
may be advantageous mainly in special circumstances, such as in two dimen- 
sions, or when vortices or vortex layers play a paramount role. 


Other introductory texts on the subject are Ferziger and Perić (1996), 
Fletcher (1988), Peyret and Taylor (1985), Shyy (1994). 


6.2 Equations of motion and boundary conditions 


Equations of motion 


The equations of motion have been discussed in Chap. 1. For ease of reference, 
the equations to be considered are repeated here. In this section the number 
of dimensions is three. We assume incompressible flow, i.e. Dp/Dt = 0, so 
that 


uy =O. (6.1) 


Conditions for incompressibility have been discussed in Sect. 1.12. We allow 
p to be variable, so that stratified flow or flow with large heat transfer can 
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be taken into account. 
The dimensionless incompressible Navier-Stokes equations are given by, in 
Cartesian tensor notation, 


Opu® 
Ot 
where f is a body force. The following units are chosen: velocity: U; length: 
L; density: po; pressure: pol/*. Then the Reynolds number is given by 


Re = poUL/p , 





+ (pu%u®) = —P a + a + f%, ge = Re~ u% + uf) , (6.2) 


with u the dynamic viscosity coefficient. 


An example of a body force occurs in buoyant flow, where the dimensionless 
body force is given by (see Sect. 1.14) 


f° =—pq*RaT . 


Here g is the acceleration of gravity, T is temperature and the Rayleigh 
number is defined by 


Ra = polg|bATL/(poU”) , 


with AT, used as unit of temperature, a measure of the temperature variation, 
and 6 the thermal expansion coefficient of the fluid. Note that the Grashoff 
number of Sect. 1.14 is replaced by the Rayleigh number, because we use a 
different velocity unit. 


The temperature is governed by equation (1.55). In dimensionless form: 


OpT 
“Fr + (PUT) a = (Pea) a +p , (6.3) 


where we have neglected the frictional heating term, and the Péclet number 
is given by 


Pe = UL/(cpk}. 


Usually 4: and k are functions of T, but cp is constant. Also p = p(T), but the 
above equations also cover situations where p depends on the concentration 
of a solute, also designated by T. If AT is of moderate size, p may be taken 
constant, and for laminar flow the same holds for pz and k. But if a turbulence 
model is applied, or T varies strongly, j: and k vary. 


Equation (6.3) is a convection-diffusion equation, and (6.2) is very similar. 
Hence, we will be able to profit a great deal from the insights gained in 
the preceding chapters concerning the numerical aspects of the convection- 
diffusion equation. The only, but significant, new complication is the lack 
of a time-evolution equation for the pressure p and the appearance of the 
kinematic constraint (6.1). 
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Boundary conditions 


Both equations (6.2) and (6.3) are parabolic, and initial and boundary con- 
ditions are to be prescibed accordingly, cf. Chap. 2. 


For the convection-diffusion equation (6.3) an initial condition 
T(0,æ) =To(#), 2 EN (6.4) 


is required, and on every point of the boundary 02 of the domain 2 exactly 
one of the following conditions must be given: 


T(t,æ) = f(t,æ) (Dirichlet), (6.5) 
OT(t,x)/On = f(t,æ) (Neumann), (6.6) 
T(t, æ) +aðT(t,æ)/ðn = f(t,æ) (Robin), (6.7) 


with a > 0 and n the outward normal. If in a physical model nothing is 
known at an outflow boundary, it is advisable to prescribe a homogeneous 
Neumann condition, as argued in Sect. 4.2. If a singularity at t = 0 is to be 
avoided, the initial temperature field To(x) in (6.4) should satisfy (6.5)-(6.7) 
at t = 0 at the corresponding parts of the boundary. 


For the momentum equations (6.2) the following initial condition is required: 
u°(0,2) = w° (æ), 


with the prescribed initial velocity field w satisfying the continuity equation 


(6.1): 


wy = 0. 
Note that there is no initial condition for the pressure, since 0p/0t does not 
occur. 


No-slip condition 


Viscous fluids cling to solid surfaces. This is called the no-slip condition. For 
the physical background of this condition, see Batchelor (1967). Hence, at a 
solid surface we have 


u“ (t,x) = v(t, x) , (6.8) 


with v%(¢, æ) the local wall velocity. The Dirichlet condition (6.8) holds also 
at open parts of the boundary where the velocity is prescribed, which may 
be the case at an inflow boundary. But at an inflow boundary one may also 
prescribe condition (6.9) given below. 
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Free surface conditions 


At a free surface the tangential stress components are zero. If the local (nor- 
mal) displacement velocity of the surface v(t, æ) is known we have the fol- 
lowing conditions: 


un” = v(t,#), nos? = 0, noht =0, (6.9) 


with n the unit outer normal, and s,t linearly independent unit tangent 
vectors. One may also use (6.9) at an inflow boundary where the normal 
velocity is known. Conditions (6.9) may also arise at a plane of symmetry, in 
which case v = 0. 


One can also have a free surface with unknown position and displacement 
velocity. Then the normal stress must also be prescribed, and (6.9) is replaced 
by 


p—n% an? = po, no PsP = 0, not = 0. (6.10) 


In special cases one may wish to prescribe non-zero tangential stress in (6.9) 
or (6.10), for example, when one wishes to take the influence of wind shear 
on a water surface into account. 


Outflow conditions 


At an outflow boundary, often not enough physical information is available 
on which to base a sufficient number of boundary conditions. Usually only the 
pressure is known. This is not as serious as it may seem, because when Re > 1 
‘wrong’ information generated by an artificial boundary condition propagates 
upstream only over a distance of O(Re~*). This is plausible because of the 
resemblance of (6.2) to the convection-diffusion equation, and may in fact be 
shown directly by applying singular perturbation analysis to (6.2) in a similar 
manner as in Sect. 2.5. In order to avoid spurious numerical wiggles it is 
advisable to choose for artificial outflow conditions the Neumann conditions 
(6.10) in the stationary case. In the nonstationary case we expect this to be 
somewhat unsatisfactory, based on our experience with the one-dimensional 
convection-diffusion equation in Sect. 5.9, where we found the transparent 
boundary condition to be better. However, the question of what are good and 
easy to apply transparent boundary conditions for the incompressible Navier- 
Stokes equations seems to be largely unsettled, cf. the review presented in 
Sani and Gresho (1994). Outflow conditions can have a significant influence 
on results of instationary flow computations, as illustrated in Jin and Braza 
(1993). In Sect. 6.5 we will consider a generalization of the transparent outflow 
condition derived for the convection-diffusion equation in Sect. 5.9 to the 
incompressible Navier-Stokes equations, discretized on a staggered grid. 
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Compatibility condition 


At every part of the boundary exactly one of the boundary conditions (6.8), 
(6.9) or (6.10) needs to be prescribed. If it is the case that along the whole of 
the boundary 092 the normal velocity v(t, æ) is prescribed, then it follows from 
(6.1) and the divergence theorem that the following compatibility condition 
must be satisfied: 


f ve) =0. (6.11) 


on 


It is shown in Temam (1985) that in order for (6.1), (6.2) to be well-posed for 
t > 0 the prescribed initial velocity field w(#) and the prescribed boundary 
velocity field must be related by 


* = Va (0, x)n® 


Wo (x)n 
on parts of 092 where the normal velocity is given. But the tangential compo- 
nents of the initial velocity need not be the same. Therefore, for example, a 
sliding wall may be set in motion instantaneously at ¢ = 0 in a fluid originally 
at rest, but one should not let the speed of a body or of an inlet flow change 
discontinuously. 


Theory 


Much work has been done on the existence and uniqueness of (weak) solutions 
of the incompressible Navier-Stokes equations. For a survey, see Feistauer 
(1993) and Lions (1996); an introduction to the subject is given in Temam 
(1985). With certain restrictions on the boundary conditions, proofs have 
been given for the existence of solutions. If the Reynolds number is small 
enough the solution is unique, otherwise non-uniqueness may occur. This need 
not preclude numerical computation, because the possible solutions are often 
well separated. Some or all of them may be unstable. When the nonstationary 
equations are solved, one of the solutions is found, depending on the initial 
conditions. Interesting examples of computation of non-unique solutions are 
given in Roux (1990). 


Exercise 6.2.1. Convince yourself that the Grashoff number measures the 
ratio of buoyancy and viscous forces, and that the Rayleigh number measures 
the ration of buoyancy and inertia. 


232 6. The incompressible Navier-Stokes equations 


6.3 Spatial discretization on colocated grid 
Choice of grid 


We restrict ourselves here to two dimensions. Generalization to three dimen- 
sions is straightforward. The domain (2 is a rectangle of size Ly x Lz. A 
cell-centered grid is used, as in Sect. 3.5. The domain 2 is subdivided in 
rectangular cells of size hy x hz. The computational grid G is the set of 
cell-centers: 


G= {z EN: g= Ti, j= (ji; j2); ja = 1, 2, ..., Me, Ma = Lafha} ; 
with æ; defined by 


æj = (27,07), 2} = (jı -—1/2)hy, z2 = (j2 — 1/2)h2 . 


All discrete unknowns reside in the cell centers. This is called a colocated! 
grid. The cell with center at æ; is called 2;. We define 


ei = (1/2, 0), ez = (0, 1/2) ` 


The value of a quantity p in æ; is designated by 9;; Pje, is located at a 
cell face, namely at 


Eje, = (Jihı, (J2 — 1/2)h2) . 
The cell at the ‘east’ side of 2; is designated by 25426, - 


If one wishes, a vertex-centered grid with nodes on the boundary 02 may 
be used instead. As shown in Chap. 3, this makes litte difference: despite 
differences in local error at boundaries, the global error will be of the same 
order. 


Straightforward discretization of the continuity equation 


Finite volume discretization of (6.1) gives 


[ a0 = hyu! fte 4 hyu? Pte? = 0, (6.12) 


j-e1 je 
N, 


The cell face values in (6.12) need to be interpolated in terms of cell center 
values. The straightforward way to do this is to use linear interpolation, 
resulting in 


hautaa + Mu hta = 0. (6-13) 


! Merriam-Webster dictionary: colocate: to locate together 
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Discretization of the momentum equation 


Taking p = 1 and f = 0 in (6.2), we have to discretize 


ðu” 
ôt 


Finite volume discretization gives 





+ FIP + pq =0, F° = uu -0% | (6.14) 


ea + Fe + P,a fd =hyhaduf /dt + ho F* ites 
É (6.15) 


+ hy Fe? te? + hpi tee — 9 


j-€a -e 


with y # a. The cell face values have to be interpolated between cell center 
values. This is quite straightforward. For the pressure this is done as follows: 


1 
Pi+ea = (Pi + Pitre.) - (6.16) 


The viscous stress 0% is approximated usin 
Pp g 


(Re“uS)jte, = Rere, (USpze, — UF)/hi » 


j+e1 

(6.17) 

1 , , 

-1 a o -1 2e: 2e1+2 
(Re us )iter = Repe (U Gize + U hzer zea)/h2 » 
etcetera. For the inertia terms we may use 

a,, p ow 1 a, p 1 a,, p 

(u%u” ites = z u \j +50 u” )j42eg » (6.18) 


corresponding to second order central discretization. 


The first order upwind scheme is obtained with (abandoning the summation 
convention for the remainder of this section) 


(u uP )jpes & (u%uP);, ub tub soe, 20; (6.19) 
(uu? )jrep S (uu? )s pac, UË +u oe <0. 


The hybrid scheme easily follows from (6.18) and (6.19) using (4.36). For 
better accuracy one may use the «-scheme (4.89) or the limited «-scheme 
(4.94), which becomes in the present case, using (4.92), 


(u*uP) ite, = Fg), p= ur, 


; (6.20) 
Pj = (Pj — Pj-2eg)/(Pj+2e8 — Pj-2e—) » 


where we assume u? -+ Us zes > 0. In the contrary case we obtain using 


(4.95): 
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(uw) ites = S(Pjsres)s =(P- Pistes) /(Pi —Yj44en)» (6-21) 


For the x-scheme without limiting, according to (4.93), f is given by 





2-K. l+k 


f(g) = z P+ 


Of course, if no limiting is used, for efficiency in programming one should not 
introduce the normalized variable ¢, but work with u® and uô directly. For 
limiting, various possibilities for f(¢@) have been discussed in Sect. 4.8. 


Frequently, for the purpose of iterative solution, the nonlinear term u%u® is 
linearized, for example by u®u? & u®a? where @ is known from a previous 
iteration or time level, or, with Newton’s method, as u@u? = au + urae — 
2u°u?. It is a bad idea to use uĉuf = uf, because it is u? that should 
be updated in the u°-equation and not uf; divergence results. With these 


linearizations, the following form of the «-scheme is convenient: 


8 ~ p 8 
(u%u?) = Uste,d (US), Ujtes = 5 (uj + ub soe.) 3 (6 22) 
~a — a a a a ° 
uj; = (us - UF 2es)/(US42e5 -= UF 265) ’ 


assuming US es > 0; the contrary case easily follows from (6.21). 


Discretization of convection-diffusion equation 


In a similar way, finite volume discretization of equation (6.3) gives 


hyhgdTj/dt + ho(ul T) PTS + hy (wT) Te 


j= ei j—ez 
-ha (Pe~ Ta) (245! — hy (Pe™ tTa) [2422 = hyhag; . 


j—eı j-e2 


To approximate this further in terms of cell center values one may write 
(Pe~ T a)jtea = Perie, (Li+2e — Tj) {ha . 


Approximation of the convective flux (u°T);4., is quite analogous to the 
approximation of the inertia flux as given before. For example, the limited 
k-scheme is obtained with (cf. (6.21)) 


(UST) j4e, = F(¥;), g=uT, (6.23) 


assuming uj + u? ze, > 0, or alternatively (cf. (6.22)) 


~ ~ 1 
(U T)jpea = Use, (Ti), Uste, = z + Uj, 2,) ' 
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Spurious checkerboard modes 


The preceding straightforward spatial discretization method is completely 
analogous to what was done for the convection-diffusion equation in preced- 
ing chapters. But now a difficulty not encountered before crops up, that has 
to do with the lack of a time-evolution equation for the pressure p and the oc- 
currence of the continuity equation (6.1). Assume Re = constant, and neglect 
boundary conditions. We will show that 


uj = (—1)31+229(2), p= (—1)9142 (6.24) 


is a solution of (6.15)-(6.18) for suitable g(t). Equation (6.13) is trivially 
satisfied. The contributions of p and u%u? in (6.15) obviously cancel. We 
have 


2 42 
Resti, = Zul, 
g+e1 hı j 
Reg? ~ 1 2jj+2e1 1 ( Liter 4 yl pit ier tien) 
jter 7 hy j t Thy U |j- 2e2 j4+2e1—2eg 


With u? given by (6.24), we find 





aljjter — EMI r ayiti g(t) . 
hF hte hyRe | 1) g( ) 
Similarly, 
12h; BEN 
h Fe jte2 _ —1)J14+32 g(t). 
1 jes hpRe | ) g(t) 
Substitution in (6.15) gives 
dg -2 -212 
z thi + hy )Re9 = 9 


so that (6.24) is a solution with 


g(t) = exp{— (hr? +hz7)t}. 


This shows that if Re >> 1 the spurious checkerboard mode (6.24) is damped 
slowly for the velocity, but not at all for the pressure. In practice this spuri- 
ous mode is found to occur to such an extent as to make the above spatial 
discretization useless. 


One-sided discretization of divu and gradp 


The spurious checkerboard mode is made possible because the discretization 
of divu = u% in (6.13) and of gradp in (6.15), (6.16) straddles the central cell 
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§2;. This may be avoided by using forward differences for Ua,a and backward 
differences for p a, replacing (6.13) by 


1)j+2e 2\jt+2e2 _ 
hau | 1! + hyu lj 7=0, 


and by approximating P a in æ; by 


or the other way around. A proof of convergence is given in Ellison, Hall, and 
Porsching (1987). The method is applied in Fuchs and Zhao (1984), who find 
that the accuracy is O(A, + h2). 


Flux splitting 


In Dick (1988), Dick (1989), Dick and Linden (1992) a fiux splitting method 
is proposed for the incompressible Navier-Stokes equations. This method is 
similar to the flux splitting scheme of Roe for the compressible Euler equa- 
tions, to be discussed in Sect. 10.3. This introduces a combination of forward 
and backward one-sided differences in a more sophisticated way than the pre- 
ceding method, ruling out spurious modes as well, but at higher computing 
cost. 


Pressure-weighted interpolation method 


A colocated discretization method of accuracy O(h? + h2) has been proposed 
by Rhie and Chow (1983). The idea is to prevent checkerboard oscillations by 
perturbing the continuity equation with pressure terms. Discretization of the 
momentum equations is as before. In Rhie and Chow (1983) the description 
of the discretization is intertwined with the description of an iterative solu- 
tion method, and it is not easy to see which discretization is actually solved. 
Furthermore, depending on the choice for the iterative solution method, the 
final result may depend on certain relaxation factors, which would be unde- 
sirable. This is not the case for the version proposed in Miller and Schmidt 
(1988), where also the discretization that is solved is given. The continuity 
equation is discretized as in (6.12), with the cell face velocity components 
evaluated as follows: 


1 h i +2e : 
Use, = 5 (uy + Uj,2¢,) + (Ga AD)? ° (no summation), (6.25) 


where A°p; = Pj42ea — 2p; + Pj-2,, B # a and aş equals the negative sum 
of the coefficients of ug, k Æ j in equation (6.15). This way of expressing 


6.3. Spatial discretization on colocated grid 237 


Us;e, in terms of cell center values is often called pressure- weighted interpola- 
tion, and the resulting colocated discretization is called the pressure-weighted 
interpolation (PWI) method. 


The reasoning behing (6.25), see Rhie and Chow (1983), Miller and Schmidt 
(1988), is intuitive and somewhat involved. The basic mathematical principle 
is that a small regularizing term is added that excludes spurious modes. We 
have A%p = O(h2), so that, assuming a? is smooth, the pressure term in 
(6.25) is O(h2hg). The error in linear interpolation between u% 
is O(h2,), so that the regularizing pressure term in (6.25) does not affect the 
order of accuracy of the discretization error for the continuity equation. The 
main idea behind (6.25) is to use the momentum equations in an approxi- 
mate way to interpolate cell face velocity components, in order to bring in 
the pressure. However, if there is a significant body force present, (6.25) does 
not work, and should be modified to bring in the body force, see Gu (1991). 


Q 
and uge, 


Substitution of (6.25) in (6.12) results in the following discretization of 
divu = 0 with the PWI method: 


ljj+2e: 2jj+2e2 
hou lze, + hiu lize, 


1 1 1 

+ h{(s74'P) i420 - (4P); + (514 P)j-2e1} (6.26) 
1 1 1 

+ hi {zz P)j+2e — (z4 P); + (zaz A"P) 5-202) = 0. 


Because in discretized form divu = 0 only in the sense of (6.26), for the 
computation of mass flux through a cell face u#,,. as given by (6.25) should 
be used in convection terms. This means that (6.23) and (6.18) must be 
replaced by 


1 
(u°T) j4e. = ai +e (Ty+2e, + T;) 


and 


a | fans l a a 
(u u )jpea = 5 Mi pea (Ui 4200 + uj) (6.27) 


for second order central discretization. For other discretizations of convection, 
such as those discussed in Chapters 3 and 4 (first order upwind, hybrid or 
a (limited) «-scheme), Tj,-, and u?,,_ (in 6.15) are approximated by the 
appropriate formulae. This has been described in Sect. 4.8 and does not need 
to be discussed further. 


Boundary conditions 


Boundary conditions for u; and T are implemented in the same way as for 
cell-centered discretization of the convection-diffusion equation, as described 
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in Chapt. 3. The reader is reminded of the fact that, as seen in Chapt. 3, loss 
of order of accuracy in local error at boundaries does not cause the global 
error to deteriorate, so that, in spite of suggestions in the literature to the 
contrary, there is no need to introduce seemingly more accurate and more 
complicated discretizations at boundaries, or to prefer vertex-centered grids. 


The PWI method requires some further specifications of conditions at bound- 
aries, beyond what is given for the differential equations, which is a disad- 
vantage of PWI. Let (j = 1, j2), so that the ‘west’ face of 2; is part of the 
boundary. In (6.15) (after using (6.16)), pj_2e, occurs, referring to a grid 
point outside the grid G. This value is approximated by extrapolation from 
the interior (Perić, Kessler, and Scheuerer (1988), Rhie and Chow (1983)): 


Pj—2e, = 2Pj — Pj +20, - (6.28) 


This is artificial, since the differential equations are not accompanied by a 
boundary condition for the pressure. 


If the normal (outward) velocity v(#, t) at the ‘west’ face of 2; is prescribed, 
then one uses 


ule, = —U(®je,, t) (6.29) 
instead of (6.25) in (6.12). According to (6.25), in uj,,, the value pj-2e, 
occurs, This is eliminated with (6.28). As a result, no pressure values outside 
G occur in (6.26). Of course, (6.29) is also used in (6.25) and (6.27) for normal 
cell face velocities in #;_¢,. If the normal velocity is prescribed over the whole 
boundary, then adding the discrete continuity equation (6.26) for each and 
every cell and substituting (6.29) at the boundaries results in 


ho{Lgvg — Swow} + hi{ Envy — Lsvs} =0, (6.30) 


where Xp indicates summation over all cell faces at the ‘east’ side of the 
boundary of £2, etc. This is the discrete version of the compatibility relation 
(6.11). The boundary data must be specified such that (6.30) is satisfied 
within rounding error. 


We found in the literature no discussion of the implementation of the free 
surface conditions (6.10) for the PWI scheme. Of course, in (6.15), pje, — 
(Re~*o!);_., is replaced by paw, and (Re~*o?!);_., and (Re™to!?)j e, are 
replaced by zero, so that no unknowns outside G occur in (6.15). In (6.26), in 
the expression (6.25) for uj,,, we eliminate p;_2-, with (6.28). It remains to 
choose u;_,,. This has to be extrapolated from the interior. Two possibilities 
arise: 
1 1 1 $4 tła 


Use, = Us, Use, = 9 — 9 Ui+2e1 . 


We will not go into this further. 
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Often, the pressure distribution at boundaries needs to be computed. How- 
ever, after the computation with the PWI scheme has been completed, the 
pressure is available only at cell centers. If required, the pressure at a bound- 
ary may be obtained by extrapolation from the interior (Perić, Kessler, and 
Scheuerer (1988), Rhie and Chow (1983)), for example by 


3 1 
Pj-e, = Pi 7 5 Pi+2e. . 


Summary of equations 


After spatial discretization, a system of ordinary differential equations in 
time is obtained. Putting all unknown velocity components is some order in 
an algebraic vector u and all pressure unknowns in an algebraic vector p, and 
dividing (6.15) and (6.26) by hih2, these equations can be written as 


= + Nu) + Gp= f(t); 
Du + Bp = g(t). 


Here N is a nonlinear algebraic operator arising from the discretization of the 
inertia and viscous terms, G is a linear algebraic operator representing the 
discretization of the pressure gradient, D and B are linear algebraic operators 
corresponding to the velocity terms and the pressure terms, respectively, in 
the discretization (6.26) of the continuity equation, and f and g are known 
source terms arising from the boundary conditions and body forces. The 
system contains no time derivative for p, and contains both differential and 
algebraic equations. Therefore it is called a differential-algebraic system. 


(6.31) 


This completes our description of spatial discretization with the PWI method 
of Rhie and Chow (1983). Temporal discretization will be discussed later. A 
theory of convergence does not seem to be available. Numerical tests and 
comparisons with other discretization methods are given in Drikakis and 
Schafer (1994), Kobayashi and Pereira (1991), Miller and Schmidt (1988), 
Perić, Kessler, and Scheuerer (1988). The accuracy seems to be the same as 
for the staggered discretization to be discussed next, and is O(h? + h3). 


Due to the relative ease with which colocated discretization can be extended 
to non-orthogonal grids, use of the PWI method has become widespread. Sta- 
tionary flow computations are presented in Acharya and Moukalled (1989), 
Burns, Jones, Kightley, and Wilkes (1987), Deng (1989), Gu (1991), Kobayashi 
and Pereira (1991), Lapworth (1988), Lien and Leschziner (1994), Miller and 
Schmidt (1988), Moukalled and Acharya (1991), Perić, Kessler, and Scheuerer 
(1988), Rhie and Chow (1983), Rodi, Majumdar, and Schénung (1989); insta- 
tionary flows are treated with the PWI method in Armfield (1994), Demirdžić 
and Perić (1990), Ho and Lakshminarayana (1993), Zang, Street, and Koseff 
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(1994). But if domain-decomposition is used (for greater geometric flexibil- 
ity, or for parallel computing), then the coupling conditions at subdomain 
boundaries are more complicated than for staggered discretization, due to 
the artificial pressure terms. 


Artificial compressibility method 


Instead of using PWI to suppress the spurious modes, one may introduce an 
artificial compressibility term in the continuity equation. This has been pro- 
posed in Chorin (1967), and applied in Belov, Martinelli, and Jameson (1994), 
Chang and Kwak (1984), Drikakis and Schafer (1994), Hartwich and Hsu 
(1988), Kwak and Chang (1984), Kwak, Chang, Shanks, and Chakravarthy 
(1986), Rogers and Kwak (1990), Rogers, Kwak, and Kiris (1991), Sheng, 
Taylor, and Whitfield (1995), Soh and Goodrich (1988), Tamamidis, Zhang, 
and Assanis (1996). For an introduction, see Peyret and Taylor (1985). The 
continuity equation (6.1) is replaced by the following artificial equation: 


ðp gy 
Ba +u% =0, (6.32) 
with 8 a parameter to be chosen. When steady state is reached, (6.32) is 
equivalent with the original continuity equation (6.1), but the time depen- 
dence of the solution is falsified. This is a major reason why use of the artificial 
compressibility method is less widespread than the PWI method. However, 
time-dependence can be approximated by introducing a pseudo-time variable, 
see Merkle and Tsai (1986), Merkle and Athvale (1987), Soh and Goodrich 
(1988), Rogers and Kwak (1990), Rogers, Kwak, and Kiris (1991). Both for 
brevity and because it is less widely used than PWI, the artificial compress- 
ibility method will not be discussed further. In a comparative study of the 
PWI and artificial compressibility methods by Tamamidis, Zhang, and As- 
sanis (1996), the PWI method is found to give better accuracy and mass 
conservation, and to require less memory. 


6.4 Spatial discretization on staggered grid 


This is the oldest and most straightforward approach to discretizing the 
Navier-Stokes equations. The method was first proposed in Harlow and Welch 
(1965), and is described in detail in Patankar (1980). On orthogonal grids it 
remains the method of choice. No artificial boundary conditions are necessary; 
the physical boundary conditions suffice. No special measures are required to 
prevent spurious modes. Theoretical convergence analysis is more developed 
than for the colocated scheme. For theoretical background, see Hou and Wet- 
ton (1993), Zhang and Kwok (1997) and references quoted there. 
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Staggered grid 


Grid points for different unknowns are staggered with respect to each other. 
The pressure resides in the cell centers, whereas the cell face centers contain 
the normal velocity components, cf. Fig. 6.1. The same system of grid point 


Fig. 6.1. Staggered placement of unknowns; —, t: velocity components; è: pressure. 


indices is used as in the preceding section. Hence, æ; is a pressure point 
a’ _ : : . 1 
containing p; and @j42,, a = 1,2 are velocity points containing u;,,, or 


2 . 
Uste, respectively. 


Note that with a staggered grid we always have a mixture of vertex-centered 
and cell-centered discretization. Unavoidably, at a boundary, some unknowns 
will have nodes upon it, whereas other unknowns have no nodes on this 
boundary, but half a mesh size removed. Therefore it is fortunate, as seen in 
Chapt. 3, that vertex-centered and cell-centered discretization are on equal 
footing as far as global accuracy and ease of implementation of boundary 
conditions are concerned. It might be convenient to put a boundary where 
the normal stress is prescribed (first condition of (6.10)), through pressure 
points, but there is no real need to do so. 


Discretization of continuity equation 


As before, the continuity equation is integrated over §2;, resulting in (cf. (6.12)) 


haw BTS + hurt = 0. (6.33) 


ey] jJ—€2 


The advantage of the staggered placement of the unknowns is that no further 
approximation is necessary in this equation. 
Discretization of momentum equations 


Finite volume integration takes place over control volumes centered at u® grid 
points. For example, for a = 1 the momentum equation (6.14) is integrated 
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over 92;4¢, (cf. Fig. 6.2). The treatment for a = 2 is quite similar, and will 





Fig. 6.2. Control volume for u}4.,. 


not be given. Similarly to (6.15) one obtains 


dul + ze: 
J {ar + Fg’ t+ pr}d2 Shy hoduj,.,/dt + hF" HY 


Sitel 


(6.34) 


12jj+e1+ +2 
+ hy Pe Gpe e +h =0. 
Now p occurs only in its own nodal points; interpolation is not necessary 
for p and spurious modes cannot occur. This is the advantage of staggered 
over colocated discretization. But a number of other quantities have to be 
interpolated between grid nodes. The viscous stress otf is approximated using 
-1 l\ æ% -1 1j 
(Re uw); = (hi Re;) ly arn 5 


(Re~'u’y);4e, 42 = (h2Rejperpe.) we eter ’ 
~ ma - i +2 
(Re lu? )jtertez = (hiRejtei+ez) lyt tiete ‘ 


With the (limited) x-scheme (4.94), the inertia terms are discretized as follows 
(cf. (6.20)): 


(u'u'); Sf(Gj;-e,), guulul, (6.35) 
Pj-e, =(Pj-e, — Pj-3e1)/(Pj+e1 — Pj-3e,) 


1 


if u; 


e, + Uibe, > 0, whereas in the contrary case 
(u'u) = f(Pjter)s Pizer = (Pizer — Pjt3e1)/(Pj-e1 — P4301), (6-36) 
or alternatively, in analogy with (6.22) 
(utu’)j = uj f(tij-e,), uj = (uie, + Us+e,) 4 (6.37) 
assuming u} > 0; the contrary case easily follows from (6.36). The second 


type of inertia term that has to be approximated in (6.34) is (pu!u?); 40,45 
Similarly to (6.35), we write 
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(uu) j4ertes = Fl Pj+er)s p = ulu? 4 (6.38) 
Pite: = (Lite, = Pj+ei—2e2)/(Pjtei+2ez = Pj+e,—2e2) ; 
assuming 
1 
2 — 2 2 
Ujiteite: 7 5 (Utes + Uj 420,49) > 0. (6.39) 


The contrary case is left to the reader. In the order to evaluate pj+e, ete. 

n (6.38), us,,, etc. has to be interpolated, which is done as in (6.39). This 
brings i in quite a few grid points in the discretization, which is avoided by 
following (6.37) instead of using (6.38): 


1,,2 m p2 ~ł 
(u u Jj+ei+ez = Uses pez) (Ujre,) 3 


assuming Ute, tez (given by (6.39)) to be positive; of course, % is defined 


similar to ~j4e, in (6.38). 


Üj pe, 


Boundary conditions 


On the staggered grid the implementation of the boundary conditions (6.4)- 
(6.10) is just as simple and done in the same way as for the convection- 
diffusion equation in Chapters 3 and 4. It is noteworthy that, unlike the colo- 
cated case, the boundary conditions that accompany the differential equations 
suffice to complete the discretization, and that no artificial extra boundary 
conditions for the pressure are required. This is easily seen as follows. Let 

= (1, jz); so that the ‘west’ face of 92; is at the boundary. If the normal 
velocity i is given, we know ul e, SO that equation (6.34) is not used, and no 
reference to pressure or velocity nodes outside the domain is made. If the 
normal velocity is not prescribed, then, as in Sect. 3.4 for the convection- 
diffusion equation on a vertex-centered grid, the finite volume integration 
(6. 34) i is carried out over half a cell. For example, for j = (1, j2), the equation 
for uj_., (which is located at the boundary x’ = 0) is obtained by integration 
over §2;_¢,, which is the rectangle with vertices at #;_¢,+¢, and #+.¢,. This 
gives (cf. (6.34)) 


J (ee + FY + pa}dQ = Sh jhodus_, /dt 
Dye, (6.40) 


1 ey tes 
shi FUE re te? + haphe, = 0. 


121 
+ he F i ey t 2 J37 e1 E2 


Since the normal velocity is not prescribed, the normal stress p + o1!! = pæ 
is prescribed, according to the boundary conditions (6.10). This means that 
we can substitute 


Fixe, + Pje, = (utut); -e1 7 Cine + Pj-e, = (uru*)j—e, + Poo 
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in equation (6.40). For F}. 4e, we need to approximate (ulu?—o!?); e; pez. 
We write 
1 ola 1 1 
Uj=eitez — 9 (jer + U;_e,42¢>) . (6.4 ) 


If u? is given at the boundary under consideration, the prescribed value is 
substituted directly in u!u*. Furthermore, in this case one has to approximate 


ol? for which we need to approximate ut, + u*. This can be done as 


J— eitez? 
follows: 


1 
l 1 
hy j-e1+2ea = Uje) ’ 


IIe 


(u's) j-ertes 


2 
(u)j-ertes -= p, (ites = U$ estes) ’ 


where Us eitez is given by the boundary condition. If u? is not given, then 
the tangential stress is prescribed, according to the boundary conditions 
(6.9), (6.10). We substitute o}? = f in F!?, and it remains to determine 
(u'u?);—~e,4e,- For ut we use (6.41), and u? is approximated with a one-sided 


: 2 
bias by UŞ per 


Summary of equations 


Putting all unknown velocity components in some order in an algebraic vector 
u and all pressure unknowns in an algebraic vector p, and dividing by hihz, 
these equations go over in a differential-algebraic system of the following 
structure: 


—+N(u)+Gp= f(t),  Du=g(t). (6.42) 


Here N is a nonlinear algebraic operator arising from the discretization of 
the inertia and viscous terms, G is a linear algebraic operator representing 
the discretization of the pressure gradient, D is a linear algebraic operator 
representing the discretization of the continuity equation, and f and g are 
known source terms, arising from the boundary conditions. 


This completes our description of spatial discretization on the staggered grid. 


6.5 On the choice of boundary conditions 


Having completed our discussion of the principles of spatial discretization of 
the incompressible Navier-Stokes equations on Cartesian grids, we will now 
present further conditions on the choice of boundary conditions. This choice 
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is not always clear-cut. At solid boundaries only one possibility presents itself, 
namely the no-slip condition, but at inflow and outflow boundaries choices 
may have to be made that influence physical realism and numerical accuracy. 
Some guidelines will be given by discussing two illustrative examples. 


Flow over a backward facing step 
Out first example is the flow over a backward facing step (Fig. 6.3). This 
is a frequently used test problem for testing numerical methods (Morgan, 


Périaux, and Thomasset (1984), Gartling (1990), Leone Jr. (1990)). The grid 


is not Cartesian, but this is irrelevant for what we want to discuss in this 


C 





Fig. 6.3. Boundary-fitted grid and streamlines for flow over a backward facing 
step. Outflow conditions: -p +0 = —po, 0°? =0. 


section. In Chapt. 13 discretization on non-orthogonal structured grids will 
be considered. The results presented here were obtained using a staggered 
scheme. 
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The geometry of the domain is defined by (cf. Fig. 6.3): [AB] = {AF| = 
IEF| = 1/2, IBC] = 3,/CD} = 1,{ED| = 2.5. At AB, which is an inflow 
boundary, we prescribe u? = 0, and u! is given by a parabolic inflow profile. 
The Reynolds number based on the length |AB| and the maximum inflow 
velocity is given by Re = 200. CD is an outflow boundary; the remainder of 
the boundary consists of solid walls. 


Upstream influence of outflow conditions 


The outflow conditions (at CD) are not known a priori. But nevertheless, 
we need to prescribe suitable conditions to make the problem determinate. 
Fortunately, there is a close analogy with the convection-diffusion equation, 
studied in Chap. 2-5. This simple equation allows a thorough understanding 
of many basic issues. In Sect. 2.5 the infiuence of outflow conditions was 
studied. Based on what we found there we may trust that the exact solution 
is influenced by the outflow conditions only in a small neigbourhood of CD 
of thickness O(Re~’), if Re >> 1. Furthermore, rapid local variation of the 
exact solution is avoided if a homogeneous Neumann condition is applied. 


Experiments with outflow conditions 


For the Navier-Stokes equations a homogeneous Neumann condition corre- 
sponds to zero tangential stress and prescribing the normal stress (i.e. approx- 
imately the pressure if Re > 1). This corresponds to boundary conditions 
(6.10): 


—-p+olt=-—p,, o71=0 onCD, 


where we assume the z!-axis parallel to BC. The resulting streamline pattern 
is shown in Fig. 6.3; it is in satisfactory agreement with results published in 
the literature, and seems physically reasonable. 


Just to show what may go wrong, we next prescribe parallel outflow: u? = 0 
at CD. To get a well-posed problem, a second relation has to be prescribed 
in relation to the equation for ut. This could be the flux through CD, or u! 
itself. Starting with the first possibility, we prescribe 


—p+olt'=-p,, u?=0 onCD. (6.43) 


We now have a Dirichlet condition for u?. We use the second order cen- 
tral discretization for the convection term. Keeping the analogy with the 
convection-diffusion equation in mind, we know that we now run the risk 
of spurious numerical wiggles, based on our experience with the convection- 
diffusion equation in Sect. 4.2. Spurious numerical wiggles indeed occur, as 
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Fig. 6.4. Streamlines for flow over a backward facing step. Outflow conditions: 
—p+oa! = pæ, u =0. 


shown in Fig. 6.4. They get worse when we also prescribe a Dirichlet con- 
dition for ut, namely a parabolic profile with the same mass flux as at the 
inlet, Again, u? = 0 at CD. The result is shown in Fig. 6.5. The results of 
Fig. 6.4 and 6.5 are unacceptable. 


As for the convection-diffusion equation, we see that with the central scheme 
the type of outfiow conditions has a large influence on the numerical results, 
and that the stress conditions (6.43) are to be preferred. With upwind dis- 
cretization, there would have been no spurious numerical wiggles in Fig. 6.4 
and 6.5, and the upstream influence of the outflow conditions would have 
been restricted to a few grid cells. But the length of the recirculation zone 
would have been too short, due to the numerical viscosity generated by the 
upwind scheme. 


An analysis of outflow boundary conditions is given by Johansson (1993). 


The chimney problem 


It may also happen that inflow conditions are unknown a priori. What we 
shall call the chimney problem is an example. Consider a vertical pipe with 
a smooth entrance nozzle (Fig. 6.6). The wall of the pipe is hot, so that the 
fluid flows upward under the influence of body forces. The flow is governed 
by the Boussinesq equations (6.2), (6.3). We assume that the Rayleigh num- 
ber is sufficiently large for the convection and inertia terms to be significant. 
This means that at the inflow boundary AB we must give Dirichlet bound- 


248 6. The incompressible Navier-Stokes equations 





Fig. 6.5. Streamlines for flow over a backward facing step. Outflow conditions: u’: 
parabolic profile; u? = 0. 


Cp----\D 


A} ----|B 


Fig. 6.6. The chimney problem. 


ary conditions to obtain a well-posed problem. This has been demonstrated 
in Sect. 4.2 for the one-dimensional convection-diffusion problem. However, 
the inflow velocity is not known a priori. This difficulty can be resolved as 
follows. We assume a smooth inflow pattern without local separation, which 
is realistic because there is a smooth nozzle in front of the inflow boundary 
AB. As a consequence, Bernoulli’s theorem holds approximately in front of 
AB. Furthermore, we assume parallel inflow with a constant velocity profile: 


u'=0, «u? = constant on AB. 
Bernoulli’s theorem gives 
1 
5 pu +P=Po on AB, (6.44) 


where po. is the pressure far away from the pipe. At CD the velocity is 
assumed to be vertical. Then the horizontal pressure gradient is zero, so that 
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p = Poo on CD. We apply therefore the outflow conditions (6.10), assuming 


the viscous stress to be negligible. By means of an iterative procedure, u? on 


AB can be determined. We make a guess u? = c°) on AB, solve and find p 
on AB, say p = p©). Then we determine c}? from 


1 
pee = po — pl, 


put ul?) = cl), solve, and so on, until (6.44) is satisfied with sufficient accu- 
racy. 


6.6 Temporal discretization on staggered grid 

We now discretize also in time, in order to obtain a fully discretized system. 
Spatial discretization on a staggered grid is assumed, so that (6.42) is our 
point of departure. 


General formulation on staggered grid 


As an example, the second order Adams-Bashforth scheme (5.74) is applied 
to (6.42), resulting in 


l 
u” — yr + ZTN (u”?) _ N(u"~?) + 3Gp"71! _ Gp"-*} 


= Snag?) — -3 (6.45) 
Du” = g(t"). 


In order to avoid an overdetermined system, u” and p*—! must be determined 
simultaneously. In order to obtain a general formulation that also applies to 


other time stepping schemes, we define p*~1/? = 3p"~1 — ip”-?, and write 
1 
u” — uty 5 T{3N(u") — N(u™~?)} + rGp 1/2 
= Frage) — Fe-)}, (6.46) 


Du” = g(t”). 


This formulation brings out more clearly than (6.45) the fact that the pres- 
sure acts as a Lagrange multiplier guaranteeing satisfaction of the continuity 
equation. 


As a second example, the Crank-Nicolson scheme is applied to (6.42), which 
gives 
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1 
u” — uty aT {N(u") +N(u"~")} +7rGpr-? 


= SAS) + FED), 
Du” = g(t”). 


As seen from these examples, multistep methods applied to (6.42) can gen- 
erally be written as 


Alu") + rGp?—? =r", 
(6.47) 
Du” = g(t") , 
where r” is known from previous time steps and the boundary conditions. 
For explicit methods, A is the identity I. The system (6.47) is an indefinite 
system for u” and p"—1/2, that can be solved iteratively with the methods 
to be described in Chap. 7 for the stationary case. However, less computing 
time is required if p*-1/? and u” can be solved for separately. To this end 
the following method has been devised, which is the method of choice for 
nonstationary problems. 


Pressure-correction method 


Equation (6.47) is not solved as it stands, but first a prediction u* is made 
that does not satisfy the continuity equation. Then a correction is computed 
involving the pressure such that the continuity equation is satisfied. The 
method is given by 


A(u*) + TGP P = r”, (6.48) 
u” — u* +7G(pr—'/? — p—3/2) = 9, (6.49) 
Du” = g(t”). (6.50) 


Equation (6.48) more or less amounts to solving discretized convection- 
diffusion equations for the predicted velocity components. We use the best 
available guess for the pressure, namely p?—3/2, Equation (6.49) is motivated 
by the fact, that if in the explicit case, where A is the identity, we eliminate 
u* from (6.48) and (6.49), then the original system (6.47) is recovered. The 
pressure can be computed by applying the operator D to (6.49) and using 
(6.50), resulting in 


1 
DGôp = —{Du* —g(t")}, pY? = p3? + bp, (6.51) 


After p"~1/? has been computed, u” follows from (6.49). 


Equation (6.49) can be regarded as a correction of u* for the change in 


6.6. Temporal discretization on staggered grid 251 


pressure. Therefore (6.48)—(6.51) is called the pressure-correction method. It 
is an example of a fractional step method, in which a time step is split up 
in sub-steps, and different physical effects are accounted for separately in 
the sub-steps. Here pressure forces are accounted for in the second sub-step, 
and inertia and friction in the first sub-step. Confusingly, the term pressure- 
correction method is often also applied to various iterative methods to solve 
the stationary Navier-Stokes equations, in which velocity and pressure up- 
dates are carried out not simultaneously but successively. Such methods will 
be encountered in Chap. 7, where they will be called distributive iteration 
methods. These should not be confused with the pressure-correction method 
used in time accurate schemes as formulated above. This method may also 
be called a projection method, because in (6.49) the new velocity u” is the 
projection of the intermediate velocity field u* on the space of velocity fields 
with discretized divergence equal to zero. 


Remembering that divgrad equals the Laplacian, we see that (6.51) looks 
very much like a discrete Poisson equation; it is frequently called the pres- 
sure Poisson equation. Note that no boundary condition needs to be invoked 
for 5p (fortunately, for no such condition is given with the original equations), 
because the boundary conditions have already been taken into account in the 
construction of D, G and g; the operator DG works exclusively on pressure 
values in grid points in the interior of the domain. The issue of boundary 
conditions for the pressure Poisson equation (which does not arise with the 
approach followed here) is discussed extensively in Gresho and Sani (1987), 
Gresho (1991), Gresho (1992). 


Even if the method is explicit (A = I), we still have to solve an implicit sys- 
tem for dp. This is an unavoidable consequence of the differential-algebraic 
nature of (6.42). 


As we remarked before, by elimination of u* it is easily seen that in the 
explicit case the pressure-correction method (6.48)—-(6.51) is equivalent to 
(6.47), and that this remains true if p”~3/? is neglected in (6.48) and (6.49). 
But in the implicit case this does not hold, and inclusion of a sufficiently 
accurate first guess, such as p"~°/?, for the pressure in (6.48) seems to be 
necessary to obtain full, i.e. O(r?), temporal accuracy. This may make it 
necessary to compute the initial pressure field at the starting step (n = 1), 
to be used instead of p~'/?. This may be done as follows. Application of D 
to (6.42) at t = 0 gives 


dg(0)/dt + DN(u(0)) + DGp(0) = Df (0) . (6.52) 
After solving p(0) from (6.52), we put p~1/? = p(0). 


In Cartesian coordinates, D is the adjoint of G, but as this is not the case in 
general coordinates, this is not be assumed here. 
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Discrete compatibility condition 


In case the pressure is not involved in any of the boundary conditions, it 
follows from the incompressible Navier-Stokes equations that the pressure is 
determined up to a constant. The system (6.51) for dp is singular in this case, 
and (6.51) has a solution only if the right-hand side satisfies a compatibility 
condition. The boundary conditions discussed in Sect. 6.2 are such that if 
the pressure is not involved in any of the boundary conditions, then the 
normal velocity component is prescribed all along the boundary, and the 
compatibility condition (6.11) is satisfied. Summing the discrete continuity 
equation (6.33) over all cells reduces, due to cancellation in the interior, to 
the following sum over boundary ponts: 


Jy+1/2,J2 1,J2+1/2) 
Drones + Doh orp — 9, (6.53) 
Fz 


where v is the prescibed velocity component at the boundary. If (6.53) is not 
satisfied exactly one should adjust v, which should not be difficult, because of 
the compatibility condition (6.11). If (6.53) holds, then it turns out that the 
elements of the right-hand side vector of (6.51) sum to zero, which is precisely 
the compatibility condition required for existence of solutions of (6.51). If one 
desires to make the solution unique one can fix ôp in some point, but iterative 
methods usually converge faster if one lets dp float. 


Temporal accuracy 


The pressure-correction method has been formulated and studied in Bell, 
Colella, and Glaz (1989), Chorin (1968), Chorin (1969), Chorin (1970), 
Dukowicz and Dvinsky (1992), Harlow and Welch (1965), Hou and Wetton 
(1993), Shen (1992a), Shen (1992b), Perot (1993), Temam (1977), van Kan 
(1986), including some rigorous convergence analysis. Indications are that the 
temporal accuracy of u” is of the same order as rhe order of accuracy of the 
underlying time stepping method (for example, O(r?) for Crank-Nicolson), 
but that the accuracy of p?~1/? is only O(r), irrespective of the time stepping 
method used. If one desires, a pressure field with improved accuracy can be 
obtained after u” has been computed (with the pressure correction method) 
by proceeding in the same way as in the derivation of (6.52), leading to the 
following equation for p"~!/?; 


dg(t")/dt + DN(u") + DGp"-/? = Df(t"), 


which very likely results in a pressure field p?—1/2 with the same order of 
temporal accuracy as the velocity field. 
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Stability 


For stability of (6.48)~(6.51), it seems necessary that (6.48) is stable. It is 
conjectured that this is sufficient for the stability of (6.48)-(6.51); numerical 
evidence supports this conjecture, cf. Chorin (1968). We restrict ourselves 
to Fourier stability analysis of (6.48). To this end (6.48) is linearized, the 
coefficients are taken constant (‘frozen’), the boundary conditions are as- 
sumed to be periodic, and the known source terms rGp*—3/2 and r” are 
neglected. Hence, carrying out Fourier stability analysis for (6.48) implies 
that the discretization of the following simplified and linearized version of 
(6.2) is considered: 


ðu” 
ot 





+ UP us — Re~’ (u + uf a =0, 
with L/? and Re constant. For further simplification we use (6.1), from which 


it follows that u? g = 0, and we obtain 


ĝu” 
Ot 





+U ue, — Reu% = 0. (6.54) 


Equation (6.54) consists of decoupled convection-diffusion equations, for 
which Fourier stability analysis is presented in Chap. 5. The computing cost 
of checking in every grid point whether a stability condition is satisfied is 
often not negligible, so in practice this is often done only every 5 or 10 time 
steps or so. 


We now present a number of methods that are frequently used in practice. 


Leapfrog-Euler 


We write 
N(u) = C(u)+ Bu, (6.55) 


where the nonlinear operator C(u) represents the spatial discretization of 
the convection terms, and the linear operator B is related to the viscous 
terms. Application of the leapfrog-Euler method (discussed in Sect. 5.8) to 
the velocity predictor version of (6.42) gives 


u* — uu"? + 2r{C(u”7!) + Bu" ~*} = 2r f(t"72), 


6.56 
u” — u" + Gp"! =0, DGp"-! = Du* — g(t"), (6-56) 


which differs slightly from the general form (6.48)-(6.51). The use of the 
neutrally stable leapfrog method often causes numerical oscillations, which 
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are commonly suppressed by the following smoothing operation, applied after 
u” has been computed: 


1 
uF = (1— yu") + zye" fu?) , (6.57) 


with the constant y typically between 0.05 and 0.1. The smoothing (6.57) 
is called the Asselin filter (Asselin (1972)). After smoothing, the boundary 
conditions are reimposed on u"~!. The global temporal accuracy seems to 
be O(r) for both velocity and pressure. Computation can be started with an 
explicit Euler step; in this step r does not need to be subjected to a stability 
restriction. 


This method has played an important role in the development of large- 
eddy simulation of turbulence (Arnal and Friedrich (1992), Boersma, Eggels, 
Pourquié, and Nieuwstadt (1994), Deardorff (1970), Eggels, Unger, Weiss, 
Westerweel, Adrian, Friedrich, and Nieuwstadt (1994), Friedrich and Unger 
(1991), Grotzbach (1982), Grotzbach and Worner (1994), Manhart and Wen- 
gle (1994), Mason (1989), Schumann (1975), Schmitt and Friedrich (1988), 
Wagner and Friedrich (1994), Werner and Wengle (1989). 


Usually the second order central scheme is used for spatial discretization, and 
the heuristic stability condition (5.78), put forward in Schumann (1975) and 
shown to be sufficient for von Neumann stability of the frozen coefficient ver- 
sion of the predictor step of (6.55) in Pourquié (1994), is commonly used. A 
weaker sufficient and necessary condition is given in Theorem 5.8.4, resulting 
in 


r < Re/ $ > h57(2 + V/4+ 9p2), Pa = |UalhaRe , (6.58) 


with q = 1 for the second order central scheme and q = 25/9 for the fourth 
order central scheme (in which case (6.58) is merely sufficient). 


Adams-Bashforth 


Application of the second order Adams-Bashforth scheme (5.74) to (6.42) 
gives equation (6.46). The pressure-correction method results in 


wut lg Srf3N(ut-4) — N(u"-?)} = FBE) = fE- 


u” — u" + Gp? = 0, DGp"~"/? = Du* ~ g(t”) ; 
(6.59) 


The global temporal accuracy of the velocity field is found to be O(r?) in 
practice. Accuracy results on p seem not to have been published, but at least 
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O(r) is expected. Because it has better temporal accuracy and does not need 
the Asselin filter, this method now seems to be superseding the leapfrog- 
Euler scheme. Examples of application can be found in Andersson, Billdal, 
Eliasson, and Rizzi (1990), Balaras and Benocci (1994), Benocci and Pinelli 
(1990), Gao (1994), Gavrilakis, Tsai, Voke, and Leslie (1986), Hoffmann and 
Benocci (1994), Kajishima, Miyake, and Nishimoto (1990), Kobayashi and 
Kano (1986), Kobayashi, Morinishi, and Oh (1992), Krettenauer and Schu- 
mann (1992), Kristoffersen and Andersson (1993), Schmidt and Schumann 
(1989), Shimomura (1991), Thomas and Williams (1994), Voke and Gao 
(1995). 


In practice one seems to proceed mainly by trial-and-error and by heuristics 
to determine a suitable size for the time step. But a dependable stability 
criterion may be deduced from Theorem 5.8.2, which gives for the «-scheme: 


T < min{71, 72, 73}, 
with 


1 -2 
n = gRe/ JO ha®{2+ (1 = n)Reltalha) , 


T = maxfqı(2/Re)"/ ($ u4/h) "5, q2V2/ J lualha} , 


a 





T3 = vile So hg? {2+ (1- 6)Relualha} 


-1/2 


» J u2{24 (1 - «K)Reļuaļha} ~? 


where we have been a bit more strict than (5.65) for simplicity, and where 
we have used the fact that parts of (5.61) and (5.65) are identical in this 
case. Here qı and q3 are given in Theorem 5.7.3. It is left to the reader to 
derive similar conditions for the fourth order central and first order upwind 
schemes, using Theorem 5.8.2 and (5.62), (5.66) or (5.63), 5.67), respectively. 


Crank-Nicolson 


Following ideas put forward in van Kan (1986), Bell, Colella, and Glaz (1989), 
application of the Crank-Nicolson scheme results in the following pressure- 
correction method: 
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1 1 
u* —ut 4 5 TIN (u") + N(u"-1)} + rGp"73/? = 5 TI F(t") +EH}, 
u” — u* +Gôp=0, 


DGôp = Du* — g(t") pri — prs? + dp ; 
(6.60) 


Unconditional stability for fixed r and ha is shown in van Kan (1986). Uncon- 
ditional von Neumann stability of the velocity predictor step of (6.60) follows 
from Theorem 5.8.10. As argued in Sect. 5.10, the multistage w-scheme is to 
be preferred over the Crank-Nicolson scheme in practice, because the lat- 
ter scheme is not strongly stable. But it is useful as an ingredient in IMEX 
schemes, treating the nonlinear inertia terms with an explicit method. Two 
examples are given below. 


However, the solution of nonlinear systems may also be avoided by lin- 
earizing the equation for u*, for example in the way suggested in van Kan 
(1986). Inspection shows that N(u*)+N(u"~!) consists of terms of the type 
e7y* +o? ly"! N being a discretization of (u%uf) g. In van Kan (1986) 
it is shown that without loss of temporal accuracy the following Newton lin- 
earization may be applied: ¢*y* + 6"~!y"—! = g*w"-1446"-1y". As a result 
N(u*) + N(u"~') is replaced by 2F(u"~!)u*, with E a linear operator, so 
that the velocity predictor step becomes linear: 


u* —u 4 E(u" )u" + 7Gp"-3/? = srtf(t”) +f(t"-")}. (6.61) 


Numerical evidence and theoretical considerations (van Kan (1986), Bell, 
Colella, and Glaz (1989)) suggest that the temporal accuracy is O(r?) for u 
and O(r) for p. 


Adams-Bashforth-Crank-Nicolson 


Less restrictive stability conditions than for Adams-Bashforth may be ob- 
tained if the viscous terms are treated implicitly with the Crank-Nicolson 
method, resulting in de Adams-Bashforth-Crank-Nicolson scheme. This re- 
sults in the following pressure-correction method: 


* n= ł n— n=- l n- * — 
u*— uP by 57 {3C(u 1) — C(u"2)} + z7Blu 1 4 u*) +rGp°~?/? 
= rf(t”71/2), u” — u" + Gdp=0, 
DGoép = Du” — g(t") , pr? = p” 73/2? + dp 
This method has been applied in Esposito (1992), Gavrilakis (1993), Huser 


and Biringer (1992), Kim and Moin (1985). The accuracy for both velocity 
and pressure is found to be O(r?) in a test example (Kim and Moin (1985)). 
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Suitable stability conditions follow from Theorem 5.8.6. Substitution of 6 = 
2//3 in (5.61), a = 1/2 and 6 = 3/4/,/2 in (5.69) results in the following 
stability condition, if the «-scheme is used for spatial discretization: 


r < maxri, min{r2, max(73, 74) }] , 


with 


a 
| 


— 8 -1 2 _i -2 -1 
z4sRe DD n=3/2 ha {2Re7! + (L—«)|ualh} , 
rs = q(3/Re) (9 ua /ha) 7, ta = qaV3/ $ ltal/ha , 


where q1, q2 are given in Theorem 5.7.3 and q3 in Theorem 5.7.5. In a similar 
way, Theorem 5.8.6 can be used to obtain stability conditions for spatial 
discretization with the fourth order central and first order upwind schemes. 


Runge-Kutta-Crank-Nicolson 


In Le and Moin (1991) it is proposed to replace Adam-Bashforth by Wray’s 
Runge-Kutta method (5.80) in order to reduce the stability restriction on 
the time step and the computing effort. For the convection-diffusion equa- 
tion the resulting method is given by (5.83). Straightforward application of 
the pressure-correction method, applying the continuity equation to the ve- 
locity field at every stage, would require three pressure corrections per time 
step, making the method expensive. However, the pressure is treated with 
Crank-Nicolson, so that just as for (6.60) one expects that O(r?) accuracy 
for the velocity is possible with just one pressure correction. In Le and Moin 
(1991) this is indeed found to be the case. The three stages of (5.83) are 
executed with the pressure from the previous time step and without invoking 
the continuity equation. Then a pressure correction step follows. This results 
in the following scheme: 


u* —u™ 14 oyrC(u"~!) + rB(aqu" + Bru") + (ar + B1)TGp?~3/? = fr , 
u** — u* + oo7C(u*) + CorC(u"*) + rB(agu* + Bou**) 
+ (a2 + b2) rGP P = fe, 
u*** — u** + agrC(u™*) + GgrC(u*) + rB(agu** + B3u***) 
+ (a3 + b3) rGP P = fre , 
u” — u*** + Gdp—0, 
DGép = Du*** — g(t”), pY? = p®-3/2 4 bp. 
In the implementation of the pressure terms we have taken into account that 


the effective time step per Runge-Kutta stage is (a, + Bk )r. The coefficients 
ax, Bk, ak and Çp are given by (5.84). The right-hand sides f*, f** and f*** 
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depend on the implementation of the boundary conditions and of body forces. 
For details on the treatment of time-dependent boundary conditions, see 
Kim and Moin (1985). If there is a body force s it can be treated with 
Crank-Nicolson or with Runge-Kutta. The latter option may be preferable if 
s depends on the solution. In that case the contribution to f**, for example, 
becomes o27s(t*) + Cg7s(t"~"), with t* = t"~1 + (a, + 6,)r. Wray’s Runge- 
Kutta method has been designed to economize on storage; only preceding 
velocity fields are needed to compute u** and u***. The accuracy of the 
velocity field is O(r7). 


Von Neumann stability of the three predictor stages leads with the use of 
Theorem 5.8.9 to the following stability condition: 


r < V3/{(2—K) X |ual/ha} - 


The multistage w-scheme 


The multistage w-scheme (5.110) with o and w given by (5.104) and (5.105) 
has better stability properties than the Crank-Nicolson scheme, because it 
is strongly stable. The computing cost is the same, because the cost of one 
stage is the same as the cost of one Crank-Nicolson step, whereas for the 
multistep w-scheme 7 can be three times as large as for Crank-Nicolson for 
the same accuracy, as illustrated in Sect. 5.10. Using the linearized version 
(6.61) in the multistage w-scheme, only linear equations have to be solved, as 
for the IMEX versions of Crank-Nicolson discussed above. But the matrices 
of the systems to be solved change in time. However, this is also true for 
the IMEX schemes if the viscosity changes with time, as would be the case 
for turbulent flow computations. Despite its attractive properties, use of the 
multistage w-scheme has been restricted to finite element methods (Glowin- 
ski and Périaux (1987), Bristeau, Glowinski, and Périaux (1987), Rannacher 
(1989), Rannacher (1993), Randall (1994), Turek (1994)), but probably it 


deserves to be used more widely. 


Application of the multistage w-scheme (5.110) as a pressure-correction 
method leads to the following scheme: 
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u* — u” + awrN(u*) + alwrN(u") +urGp"-*/? = wrf" , 

ut” — u* + Gdp=0, DGédp= Du* — g(t" +u7), 

prt”? = prvi? + dp, 

u* — u” t” 4 aw TN(u*) + aw TN (u”t”) +w rGprtel? = w rfrtice ; 
u”tl-€ — u* + Gôp=0, DGôp = Du* — g(t? +T- wr), 

prti = prtwl2 4 Ôp, 

u* — uti- 4 awN(u*) +a wN (uti) + wrap te??? = wr ft , 
ut? _y* 4 Gsp=0, DGédp = Du* — g(t"*?) , 

prti-w/2 — p12 4 ĝp. 


The right-hand sides are to be computed from the body force s (if any) 
and the velocity boundary conditions at appropriate times. For example, for 
f"t1—-” one uses s(t” + 7 — wr), the velocity boundary conditions on ut” 
at t” + wr and on u”, i.e. at t” + 7 —wr. Without loss of accuracy, linear 
systems may be obtained by Newton linearization, replacing quantities like 
é*y)* by, in the second stage for example, pynte + or te y* — gr teyrtye. 


The PISO method 


The pressure-correction method (6.48)-(6.50) is not the only way, of course, 
to uncouple velocity and pressure by operator splitting. It is easy to think 
of variations, but no clear advantages over the basic method (6.48)-(6.50) 
seem to accrue. Just to show one example of how things can be done a lit- 
tle differently, and to give the reader a flavor of the considerations followed 
by practitioners, the PISO (Pressure Implicit with Splitting of Operators) 
method (Issa (1986)) will be discussed briefly. Equation (6.47) is approxi- 
mated by the following steps: 


u* — u”! + 7N(u") + TGP P = rf(t) , (6.62) 
u” — u” ~l t rN (u*) + rGp T1? = rft”). (6.63) 


By applying the discrete divergence operator D to (6.63) and imposing the 
continuity equation we obtain the following pressure equation: 


DGp"1/? = {pun -~ g(t”)} — DN(u") + Df(t") . (6.64) 


After p®~1/ is solved from (6.64) it is substituted in (6.63) to determine u”. 
In (6.64) we do not replace Du"~! by g(t"—1), in order to take account of 
the possibility that Du"—! Æ g(t"—1), due to inexact solution of the pressure 
equation for p"~3/?; this is to avoid accumulation of errors. 


For von Neumann stability analysis in the same way as before, the pressure 
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terms and right-hand sides in (6.62), (6.63) are neglected. This makes (6.63) 
identical to (6.62), and we are left with the implicit Euler scheme, which is 
unconditionally stable. It also follows that the temporal accuracy of PISO is 
O(r), and this remains so if a second velocity-pressure correction is appended 
to (6.62), (6.63), as proposed in Issa (1986) (in order to improve the accuracy 
of the pressure, but found to yield no advantage in Issa, Gosman, and Watkins 
(1986)). To improve results for 7 large (aiming for the steady state solution), 
equation (6.63) is made implicit in the velocity, while taking care that velocity 
and pressure may still be solved for separately. One writes 


N(u) = Nou + Ni(u) , 
with No a diagonal matrix containing the contribution of the central element 


of the discretization stencil, suitably linearized. Equation (6.63) is replaced 


by 


(I+ 7No)u” — u”! + rNi(u*) + rGp i? = rp(t") . (6.65) 
It follows that 
u” = (I+ rN) Hu"! — rN (u*) — rG? 4 rft), (6.66) 


where the inverse of J + 7Npo is readily available, because No has been cho- 
sen diagonal. By invoking the continuity equation we obtain the following 
pressure equation: 


D(I + TN) Gp"? = 
-1 1 n—i * n 1 n (6.67) 
D(I + tNo)*{au"~" — Ni(u*) + f(t )} = = g(t") . 


n—-1/2 


Solution for p and substitution in (6.66) gives u”. 


Finally, the method is made more efficient by computing increments instead of 
approximations. The predictor equation (6.62) remains, The corrector equa- 
tion (6.65) is modified by subtracting (6.62). The pressure equation is ob- 
tained in the same way as before by invoking the continuity equation. This 
results in the PISO method as proposed in Issa (1986): 


(I + 7No)u* — u”! + 7Ny(u*) + rGp 3? = rft”), 
(I +rNo)(u” — u*) + 7G(p?— 1? — p—3/2) = 0, (6.68) 


D(I + tNo)*G(p"-/? — pr!) = “(Dut — g(t")}. (6.69) 


The advantage of solving for increments is seen to be, that the known terms 
in (6.68) and (6.69) are cheaper to evaluate than in (6.65) and (6.67). 


The temporal accuracy of PISO is O(r), and there seems to be no advantage 
over the methods discussed before. But as a time stepping method to compute 
stationary solutions using large 7 it is useful. It has much in common with 
iterative methods to solve the stationary equations that will be discussed in 
Chap. 7. 
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6.7 Temporal discretization on colocated grid 


Equation (6.31) is our point of departure. Choosing a temporal discretization 
method, a system of equations of the following type results: 


Afu”) + Gp? = pn (6.70) 
Du" + Bp? = g(t”) , (6.71) 


where r” is known from previous time steps and the boundary conditions. 
The system has to be solved iteratively. The residual of (6.70) needs to be 
reduced to a size commensurate with the temporal discretization error, but 
(6.71) has to be solved accurately, to ensure mass conservation. A general 
framework and a theoretical basis seem to be lacking; in this respect the sit- 
uation is more favorable for staggered grid methods. 


Successive iterands approximating u” and p"—1/? will be denoted by v™ and 
qg™-1, m = 1,2,... respectively. We choose q? = p*—3/?, One tries to avoid 
solving for v and q simultaneously. First, an iterative method that is remi- 
niscent of pressure correction is formulated (cf. (6.48)—(6.50)): 


A(v*)+7Gq™" =r", (6.72) 
v™ — v* +7rG(q™ —q™"') =0, (6.73) 
Dv™ + Bgo} = g(t”). (6.74) 


Application of D to (6.73) and substitution of (6.74) results in the following 
pressure equation: 


1 n m m- 
(Dv* + Bg™! —g(t")), qg™ =q + ôq . (6.75) 


After solving for q™, v™ follows from (6.73). Unlike the staggered grid 
method, we cannot stop after one iteration, writing u” = v!, pT? = q}, 
because the residual in (6.71) is still too large, since q lags behind in (6.74). 
This is a distinct disadvantage of the colocated scheme for time-dependent 
flows, compensated somewhat by the fact that there is no reason anymore to 


solve (6.75) accurately. 


In order to improve convergence, a PISO-like approach may be taken. We 
write 


A(u) = Aou + A (u), 


with Ao a diagonal matrix that may be chosen in various ways to enhance 
convergence. We write instead of (6.72)-(6.74) 
Aov* + Ay(v") +7rGq™ =r", (6.76) 
Aor” + Ai(v") + 7rGq™ =r", (6.77) 
Do” + Bq} = g(t"). (6.78) 
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Subtracting (6.76) from (6.77) gives 
Ao(v™ — v*)4+ 7Gbqg=0, ôq =q" -qg 
It follows that 
v™ = »* — rÁ 'Góq. 


Application of D and use of (6.78) results in the following pressure equation: 


1 
DAj*Géq = ={Dv" + Bq™~* — g(t")}. (6.79) 


After convergence, we put u” = v”, pr—t/2 


can be taken. 


= q” and the next time step 


Equations (6.72) and (6.75), or (6.76) and (6.79), or, in the case of coupled 
solution of u and p, equations (6.70) and (6.71), can be solved with itera- 
tive methods developed for the stationary case, to be discussed in Chap. 7. 
Stability of the temporal discretization method chosen for (6.70), (6.72) or 
(6.76) can be studied in the same way as for staggered schemes. Examples 
of time-accurate computation with colocated spatial discretization using the 
PWI method (or something similar) may be found in Armfield (1994) (us- 
ing Crank-Nicolson), Cantaloube and Lé (1992), Takemoto and Nakamura 
(1986), Takemoto and Nakamura (1989), Zang, Street, and Koseff (1994) 
(using Adams-Bashforth-Crank-Nicolson), Demirdžić and Perić (1990) (us- 
ing implicit Euler), Ho and Lakshminarayana (1993), Kost, Bai, Mitra, and 
Fiebig (1992), Kost, Mitra, and Fiebig (1992) (using BDF), and Lé, Ryan, 
and Dang Tran (1992) (using second order Adams-Bashforth). 


On the basis of what we have seen so far, for the computation of nonstation- 
ary flows, staggered discretization seems superior to colocated discretization, 
because of greater simplicity, efficiency and accuracy. But on general curvilin- 
ear grids, staggered spatial discretization is generally considered to be more 
complicated and less efficient than colocated discretization, which seems to 
be the more frequently used method on general grids. 


7. Iterative methods 


7.1 Introduction 


An in-depth treatment of iterative methods in linear algebra will not be given, 
because this is a huge subject, that would require a separate volume. Instead, 
only a bird’s eye view of the subject will be given, providing ample references 
to the literature, paying particular attention to difficulties that are peculiar 
to the incompressible Navier-Stokes equations. The uninitiated reader who 
wants to use these methods is advised to consult the references that we will 
quote. 


The algebraic systems that arise by finite volume discretization are extremely 
sparse, and also very large, because many grid points are required for accu- 
racy. Therefore iterative methods are more efficient and demand far less stor- 
age than direct methods, especially in three dimensions. In two dimensions, 
direct methods using sparse matrix techniques can still be useful. Hence, we 
will confine ourselves to iterative methods, with one exception: the equation 
for the pressure correction can often be solved very efficiently by so-called 
fast Poisson solvers, based on Fourier transformation and/or cyclic reduction. 


Efficiency and robustness 


When computing N unknowns, a method may be said to have optimal effi- 
ciency if the computing work W = O(N). This may be achieved by multigrid 
methods. The fast Poisson solvers mentioned above have W = O(N InN), 
which is also quite good. Generally, iterative methods have W = O(N%), 
a>. 


As the same time, the method needs to be robust (antonym: fragile), which 
means that the efficiency is insensitive to changes in the problem, such as 
variations in grid point distribution (especially cell aspect ratios), Reynolds 
number, flow direction or coordinate system (occurrence of mixed deriva- 
tives). In general, one should be wary of relaxation parameters that need to 
be adjusted to the problem at hand. Guidelines for improving robustness are 
given at the end of Sect. 7.2. 
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Obviously, in practice the turn-around time (i.e. the elapsed wall-clock time 
between problem input and completion of computation) is important. The 
turn-around time may be decreased by using a faster computer. The faster 
computers use parallel processing. An introduction to high performance com- 
puting in fluid dynamics is given in Wesseling (1996a). 


The reader is assumed to be familiar with the basics of iterative methods in 
linear algebra. An elementary introduction is given in Golub and van Loan 
(1996), Chap. 10. Classics devoted to the subject are Faddeev and Faddeeva 
(1963), Hageman and Young (1981), Varga (1962) and Young (1971). More 
recent works are Axelsson (1994), Bruaset (1995), Greenbaum (1997), Hack- 
busch (1994), Saad (1996). 


Software 


Much standard software is available, easily accessible on the Internet. Try for 
instance: math.nist.gov or www.netlib.org. 

This software can be used directly, or as examples or building blocks (tem- 
plates: Barrett et al. (1994)) for the development of customized software. 
Of particular interest are the following libraries: BLAS (basic linear algebra 
subprograms), FISHPACK (fast Poisson solvers for the pressure equation), 
ITPACK (iterative methods) and LAPACK (numerical linear algebra). 


7.2 Stationary iterative methods 


Let the linear algebraic n x n system to be solved be denoted by 
Ay =b. 
A stationary iterative method is defined as follows: 
yt? = By +c. (7.1) 


For consistency, we must have y = By + c, hence ¢ = (J — B)A~1b, so that 
(7.1) can be rewritten as 


My*ti = Ny +b, (7.2) 
with M—N =A (M = A(I— B)’, N = MB). So we see that every 
stationary iterative method corresponds to a splitting of A. Of course, M is 


chosen such that (7.2) can be solved with little work. We have 


B=- MA= MUIN. 
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The rate of convergence 


The error e* = y — y* satisfies 
e*t! — Be* , 
and convergence is governed by B, called the iteration matriz. We have 
llel] < BMI] {eel . (7.3) 
Since the spectral radius satisfies 
p(B) = lim ||B*||'/* (7.4) 


(cf. Hackbusch (1994), Theorem 2.9.8) we see that we have convergence if 
and only if 


p(B) <1. 


The smaller p(B), the faster convergence. This follows from (7.4), but be- 
comes especially clear in the case that B is normal, and the spectral norm is 


used. Then 
|B*|| = p(B) , 
and (7.3) becomes 
ile || < p(B)* |eP It - (7.5) 


This inequality is sharp, since equality is obtained in the special case that eo 
is an eigenvector belonging to an eigenvalue with |A(B)| = p(B). 


When to stop? 


We will not present a thorough discussion of termination criteria for iter- 
ative processes. Good termination criteria are usually problem-dependent. 
Our main purpose here is to show that the difference between two successive 
iterates usually does not give a good indication of the precision achieved. 


Let B have a single dominating real eigenvalue A = p(B), with corresponding 
eigenvector x. Then we have after many iterations 


E arr, (7.6) 
with a some constant. It follows that 


ys th — yk = ad (A lx = (A- De®, 
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hence 


ka Y 

= TT 
We see that if A = 1-e, 0 < € € 1, as frequently happens (slow convergence), 
then the error is much larger than the difference between successive iterations. 
If the assumption that B has a single dominating eigenvalue is satisfied, then 
an error estimate can be based on (7.7). To this end we need to estimate À, 
which can be done by 


A S |fy*tt — y" l/l — yO YL. (7.8) 


Often, it is better to derive a termination criterion from the residual 
rë = b — Ay* instead of from the difference between two successive iterates. 
After k iterations we have solved exactly the following neighboring problem: 


€e 


(7.7) 


Ay* =b- r", 


and frequently the perturbation of the right-hand side has a physical inter- 
pretation which enables one to decide what size of ||r*|| is tolerable. 


Regular splittings and M- and K-matrices 


We will now discuss an important class of matrices for which it is easy to 
obtain convergent stationary iterative methods of the type (7.2). All matri- 
ces to be considered have dimension n x n, and by A > 0 we will mean 
aj; > 0, 24,7 = 1,..., 7. 
Definition 7.2.1. The splitting 

A=M-N 


is called regular if M~' > 0 and N > 0. The splitting is called convergent if 
(7.2) converges. 


Definition 7.2.2. (Varga (1962), Definition 3.3). The matrix A is called an 
M -matriz if aij < 0, i Æi, i,j =1,...,n, A is non-singular and A`! > 0. 


Theorem 7.2.1. A regular splitting of an M -matriz is convergent. 


Proof. See Varga (1962) Theorem 3.13. 0 


Hence, it is worthwhile to try to discretize such that A is an M-matrix, be- 
cause then convergent iterative methods are easy to obtain. As discussed in 
Chap. 4, it is also desirable that the discretization is of positive type (Defini- 
tion 4.4.1), in order to exclude spurious wiggles. Fortunately, discretizations 
of positive type result in M-matrices. This we will now show. 
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Theorem 7.2.2. Ifan > 0, ajj < 0, 7 = 1,...,2, 
A is an M-matriz if and only if p(B) < 1, B= 
C= D-A. 


J 
D-1C, D = diag (A), 


Proof. See Young (1971), Theorem 2.7.2. o 


A matrix has weak diagonal dominance if 
|ass| > ) la;;|, i= 1, eun, 
jJ#t 


with strict inequality for at least one i. A matrix A called irreducible if the 
system Ay = 6 does not consist of subsystems that are independent of each 
other. Of course, this is the usual case in practice. If not, one proceeds with 
the independent subsystems separately. 


Theorem 7.2.3. If A has weak diagonal dominance and is irreducible, then 
det(A) #0 and a; $0, 1=1,...,n. 


Proof. See Young (1971), Theorem 2.5.3. o 


Theorem 7.2.4. If A has weak diagonal dominance and is irreducible, then 
p(B) <1, with B defined in Theorem 7.2.2. 


Proof. (see also Young (1971) p. 108). Assume p(B) > 1. Then B has an 
eigenvalue u with |u] > 1. Furthermore, det (Q) = 0, Q = I — p~*B. Since 
A is irreducible, so is Q. Since |u~"| < 1, Q has weak diagonal dominance. 
From Theorem 7.2.3, det (Q) # 0, so that we have a contradiction. o 


The foregoing theorems allow us to formulate a sufficient condition for A to 
be an M-matrix, that can be verified simply by inspection of A. 


Definition 7.2.3. A matrix A is called a K-matriz if 


Qi > 0, i=l, aun, 


aij < 0, j=l, n, jFt, 


and 


X aj >0, i=l, esn, 
j 
with strict inequality for at least one 2. 


Theorem 7.2.5. An irreducible K -matriz is an M-matriz. 
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Proof. According to Theorem 7.2.4, p(B) < 1. Then Theorem 7.2.2 gives the 
desired result. oO 


It is easy to see, taking the boundary conditions into account, assuming that 
we have a Dirichlet or a Robin condition at least on part of the boundary, 
that a discretization of positive type (Definition 4.4.1) results in a K-matrix. 
This matrix is usually irreducible, otherwise the domain can be decomposed 
into subdomains that can be solved for independently. Hence we have an 
M-matrix. 


If the discretization is of higher order, it cannot be of positive type (The- 
orem 4.4.2), and does not result in an M-matrix. Then one can use defect 
correction (Sect. 4.4) on a scheme of positive type. In each step of defect cor- 
rection, one has to deal with an M-matrix, so that iterative solution methods 
have favorable convergence properties. 


Regular (and hence convergent, Theorem 7.2.1) splittings are easy to obtain 
for M-matrices: 


Theorem 7.2.6. Let A be an M-matriz. If M is obtained by replacing some 
elements aij, j £ i by bj; satisfying aj; < bij < 0, then A= M-N isa 
regular splitting. 


Proof. This theorem is an easy generalization of Theorem 3.14 in Varga 
(1962), suggested by Theorem 2.2 in Meijerink and van der Vorst (1977). 
CJ 


Well-known examples of regular splittings are (block-) Jacobi and (block-) 
Gauss-Seidel methods, for which M is obtained by replacing certain elements 
aij by zero. Another class of methods is generated by incomplete LU factor- 
ization (ILU) in which one chooses 


M = LU , 


with L and U sparse and easy to compute lower and upper triangular matri- 
ces. Examples and references to the literature are given in Axelsson (1994), 
Bruaset (1995), Hackbusch (1994), Wesseling (1992). For A an M-matrix, 
existence of L and U, numerical stability of the associated algorithms and 
regularity of the resulting splitting has been proven in Meijerink and van der 
Vorst (1977). 
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Acceleration of stationary iterative methods 


In general, stationary iterative methods converge so slowly, that the ideal for 
the computing cost W = O(N) is far from reached. For example, for the 
standard 5-point finite difference discretization of the Laplacian on the unit 
square with uniform step-size h in both directions with Dirichlet boundary 
conditions, we have 


p(B) = 1—O(h?) (7.9) 


(cf. Hackbusch (1994), Sections 4.7 and 5.7) for methods of Jacobi and Gauss- 
Seidel type. Requiring an error reduction of O(h?) (commensurate with the 
discretization error) the number of indications m must satisfy (cf. (7.5)) 


p™ (B) = O(h?). (7.10) 
From (7.9) and (7.10) it follows that 
m = O(Inhk/Inp(B)) = O(—-h7? lnh). 


Since the work per iteration is O(N) (N the number of unknowns) and 
h = N-1/2 we find that the total computing work satisfies 


W=O(N*InN), a=2. (7.11) 


By introduction of relaxation parameters it turns out that a may be brought 
down to a = 3/2 (provided the optimal relaxation parameter can be deter- 
mined, which is difficult in general), but in general the computing work of 
basic iterative methods cannot be reduced further. However, these methods 
remain important, because they lend themselves to acceleration. This opens 
up the possibility to bring a down further and perhaps also to eliminate the 
factor In N in (7.11). There are two classes of acceleration methods that when 
combined with a suitable stationary iterative method result in the most ef- 
ficient and robust methods available at present. These are Krylov subspace 
methods and multigrid methods. The stationary iterative method is usually 
called preconditioner if a Krylov subspace method is used for acceleration, 
and is called smoother if multigrid is used. 


Robustness 


Robustness is only achieved with accelerated stationary iterative methods, if 
the preconditioner or smoother is suitably chosen. Guidelines can be derived 
from analysis of their performance for the convection-diffusion equation and 
the rotated anisotropic diffusion equation: 


—E(Y 11 + ~,22) + CU, + suU? = 0, (7.12) 
~ (ec? + sp1 — 2(e — l)esu 12 — (es? + c?)u a2 =0, (7.13) 
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where s = sin 8, c = cos. There are two parameters to be varied: € > 0 
and 8. When discretized on a uniform grid, these equations model the effect 
of varying the Reynolds number, flow direction, cell aspect ratio and non- 
orthogonality of the coordinate system (occurrence of a mixed derivative). 
When efficiency is good for all £ and 3, a method is called robust. Con- 
clusions drawn from analysis of (7.12) and (7.13) carry over heuristically to 
more complicated systems, such as Navier-Stokes. Efficiency and robustness 
of many stationary iterative methods as smoothers for multigrid is inves- 
tigated in Wesseling (1992) Chap. 7. It is general experience, that a good 
smoother for multigrid is also a good preconditioner for Krylov subspace 
methods. The following methods are found to be robust in two dimensions: 


— Damped alternating line Jacobi; 

— Alternating symmetric line Gauss-Seidel; 
— Alternating damped zebra Gauss-Seidel; 
— Certain ILU methods. 


See Wesseling (1992) for remarks on robustness in three dimensions. 


Exercise 7.2.1. Derive equations (7.6) and (7.8). 


7.3 Krylov subspace methods 
Basic idea 


The system to be solved is denoted by Ay = b. When A is large and sparse 
it is attractive, because of efficiency and simplicity, to use A only to multiply 
with. This means that we can build polynomials in A. A general form of 
possible algorithms is 


yh =y amp, p™=Om(A)r®, r°=b— Ay, 


with Ôm an arbitrary polynomial of degree m, and a,, some real number. 
The special choice of r° as vector to be multiplied by 6,,(A) is made because 
we want to have p° = 0 in case y° is the exact solution. We have 


r” = b— Ay” =b- Ay"! — am-14p™7! 
= rT! — am-1Abm-1 (A)r? (7.14) 
= po_ A{Qm—19m—i(A) + Am—20m—2(A) +... + a@o6o(A)}r° 
bm(A)r° , 


where ¢,, is a polynomial degree m with the following property: 


¢m(0)=1. (7.15) 


H 
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Since 
y” = A-*(b— r°) + {am 19m-1(A) + Om—29m—2(A) + ...a1090(A) }r® , 
we have 
y” — y? = {am—19m—1(A) + Om—29m—2(A) +... + a090(A) }r® . 
We see that 
y” —y E Km(A,r°), 
with the m-dimensional Krylov subspace Km defined as 
Km(A,r°) = span {r°, Ar?, ..., A" r°}. 


Methods that look for optimal approximations to y — y? in Km(4, r°) are 
called Krylov subspace methods. Introductions may be found in Bruaset 
(1995), Golub and van Loan (1996), Greenbaum (1997), Hackbusch (1994), 
Hageman and Young (1981), Saad (1996). 


There are various ways to specify optimal approximations. Let ||- || be some 
norm. Because of (7.14) and (7.15) we would like to construct ¢,, such that 
llém(A)r°|| is as small as possible, under the constraint ¢,,(0) = 1. More 
precisely, let us define the following set of polynomials: 


1 
I] = {¥(z) : (0) = 1, is polynomial of degree < m}. 
We want to construct dm E IL, such that r™ = dm(A)r° satisfies 


Ir” = min{||p(A)r"|| 7b € TT}. (7.16) 


Equivalent formulations of this optimal approximation property are obtained 
as follows. Equation (7.15) shows that 


r” =r°— v, v € Vm = span {Ar®,..., A™r?} , 
and (7.16) is equivalent with 


[|r"|| = min{{|r° — v|| : v € Vm} = min{||r° — Az|| : z € Km(A,r°)}. 
(7.17) 


Hence 
[A(y ~ y™)|| = min{||A(y — yo = z)|]: 2 € Km(4,r°)}, 
which gives the following formulation of the optimality property: 
|A(y — ¥™)I| = min{}[A(y — w)|| : w — yo E€ Km(A,r°)} . (7.18) 
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Rate of convergence 


For a general matrix A the convergence behavior of Krylov subspace methods 
can be quite erratic and hard to predict, see Greenbaum, Ptak, and Strakos 
(1996). When A is symmetric we can say more, however. If A is symmetric 
there exists a unitary matrix Q (i.e. QTI = QT) such that 


QTAQ=A, A=diag(\y,...,An), Ax Teal. 
Until further notice we use the spectral norm. A unitary matrix Q satisfies 


IQI = 1. 


For Krylov subspace methods for symmetric A the convergence behavior 
depends on all eigenvalues of A. This follows from the following theorem. 


Theorem 7.3.1. Rate of convergence of Krylov subspace methods 
Let the nxn matriz A be symmetric with eigenvalues d4,..., An. Let r™ satisfy 
(7.16). Then 


Ilr || < [[r°l minfmax{y(A) : à € o(A)},d € TI} (7.19) 


where o(A) = {Aq,..., An}. 
Proof. We have (A) = Qy(A)Q7?. From (7.16) it follows that 


Ir" || 


1 
min{||Qv(A)Q-*r°||: 0 € TT} 


A 


< İl’ HQI |Q-*|[min{[v(A)|] y € TT} 


llr°]| min[max{|p(A)| : A € o(4)}, y € TI]. 


Note that it follows from (7.19) that convergence is monotone: 
[Ir <r?" . 


It remains to find efficient methods that generate residuals satisfying (7.16). 
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Preconditioning 


From Theorem 7.3.1 it follows that it depends on the distribution of (A) 
how large m must be for ||r™|| to be small. If the eigenvalues are clustered 
around a point x € R, then already |[r+|| is small, since we have 


min | max{|W(A)|: A € o(A)}: y e TI] | < max{|1 — å/z|: à € o(A)}. 


Preconditioning means changing the original problem Ay = b such that the 
spectrum of the new problem is more clustered. The preconditioned problem 
is 


Aj=b, A=CrACR, ¥=CR'y, b=Crb. (7.20) 


Cy and Cp are called the left and right preconditioner, respectively; applica- 
tion of Cr is also called postconditioning. We may have Cz = I, or Cr = I, 


or both Cz and Cpr £ I. The aim is to have o( A) much more clustered than 
a(A), giving much faster convergence of Krylov subspace methods. 


Suitable preconditioning methods may be derived from stationary iterative 
methods. As discussed in Sect. 7.2, these can be written as 


Myt = Ny" +b, 


where y**! can be evaluated at little cost, i.e. evaluation of M~!z is cheap. 
We can take Cz = M-! or Cp = M~t. With ILU methods (cf. Sect. 7.2) we 
have M = LU, and we can take Cg = L7!, Cr = UTL. Another possibility 
is to take Cr = M7!, Cr = My’, with Mı coresponding to a forward and 
Mz corresponding to a backward Gauss-Seidel method. For remarks on the 
robustness of such preconditionings, see Sect. 7.2. It is preconditioning that 
makes Krylov subspace methods efficient. One may say that preconditioning 
accelerates Krylov subspace methods, or, equivalently, that Krylov subspace 
methods accelerate stationary iterative methods. 


The GCR method 


Of course, in computational fluid dynamics we have to deal with nonsym- 
metric matrices. The GCR (generalized conjugate residuals) method finds 


for general A the optimal Krylov subspace approximation in the sense of 
(7.16), (7.17) and (7.18). 


We start from (7.17). Noting that V,, is a linear subspace, we can decompose 


r° uniquely as 


O pO, ,0 0 0 
re=tTytry, rye Vm, Po LVn. 
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Let us define orthogonality in the sense of the Euclidean inner product (.,.): 
r Lq (r,q4)=0. 
We write, with v € Vm, 
lr? — oly = (ry = v + rps ro — v +rp) = (ro — v, r9 — v) + (r8, r9), 


with |} - ||2 the Euclidean norm. Obviously, (7.17) is solved by v = r?, which 
implies that 


r” LVm. 


This property enables us to determine r+, ..., r'™ and y!, ..., y™. Let {v}, ...,v™} 
be an orthonormal basis for Vm. Then 


mi 
M e ,,0 0 _ ,,0 cayd a (pO pi 
r? =r -r =r -5 ajv, aj= (rv). 
1 


This holds for m = 1, 2, .... Note that a; does not depend on m. We can write 


aj = (r° — YO (r°, oF Jv", vt) = (r971 w), 


so that 


This shows what the residuals should be. Since y” = A`!(b-— r™), the 
corresponding iterands satisfy 


y” — y™”-1 -+ (rL v™)s™, g™ = A-~ly™ ; 


It remains to determine v',...,v” and s!,...,5” (avoiding the use of A~). 
This can be done with the Gram-Schmidt orthogonalization process. Given 
an orthonormal system v',...,v”—1, and given si = A710), j = 1,...,m—], 
the following algorithm is executed to find v™ and s™ such that {v!,..., v™} 
is orthonormal and s” = A7!y™; 


Gram-Schmidt algorithm 


Choose s™; 
v” = As™; 
for j = 1,...,m—ido 
a= (v™, vI); v™ = v™ — avi; 5™ = s™ — asi; 
end for 


vm = v™ /llv™ lle; s™ = s” /|lv™ IIo; 
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It remains to choose s™. In a Krylov subspace method this should involve 
multiplication by A, so that we search for y™ in all of Km, in order to profit 
from the optimality property (7.18). A suitable choice is s” = r™~!, leading 
to the GCR method as proposed originally in Eisenstat, Elman, and Schultz 
(1983): 


GCR method 
Choose y; r° = b— Ay®; m= 0; 
while stopcriterion not satisfied do 


m=m+ l; 
Choose s™; \\ For example: s” = r™~! 
v” = As”; 


for j = 1,..m— 1 do \\ Gram-Schmidt 
a = (v™, vi); v™ = v™ — avi; s” = s” — asi; 
end for 
vm = v™/|lv™|[25 s” = s/e]; 
y™ — yr} + (rroi y™)s™; rm — pri _ (rrt, v™)y™; 
inspect stopcriterion 
end while 


Computing work 

The vectors v!,...,v™ and s!,...,5™ have to be stored, and the work for 
Gram-Schmidt grows quadratically with m. Therefore, in practice, after a 
certain number of iterations, m say, the process is restarted or truncated. In 
a restart v!,...,v™ and s!,...,s are removed, and the algorithm is restarted 
with y? = y”. With truncation, we allow only m vectors sî and vf, and 
replace an old vector by a new one. Of course, the optimality property (7.18) 
gets lost when restart or truncation takes place. 


Robustness and convergence 


GCR satisfies the optimality property (7.16) (and, equivalently, (7.17) and 
(7.18)) with ||-|| = |]-||2. It can be shown that the only way in which GCR can 
lose the optimality property (apart from restart or truncation) is that r"~1 = 
r™—2_ In that case one way choose s” = As™~—!, in order to search in the 
full space K,,. The rate of convergence can be estimated by Theorem 7.3.1, 
if something is known about o(A), and if A is symmetric. Inspection of the 
GCR algorithm shows that it can break down only if ||/v™|/2 = 0. It can 
be shown that this can only happen if y”~! = y, ie. the exact solution is 
reached. From (7.17) it follows that convergence is monotone. We see that 
GCR is very robust. For efficiency, GCR usually needs to be preconditioned. 
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Preconditioning 


The best choice for s” in the GCR, method is to make it equal to the error: 
s™ = A7!r™-1 so that y™ is exact. This is not practical, of course, but we 
could determine s™ by 


Solve approximately: As” =r™—!, (7.21) 


As approximate solution method one can take, for example, an iteration with 
a stationary iterative method based on a splitting A = M — N, determining 
s™ by 


s" — M-1,m-1 ; 
It is easy to see that this results in 
y” — y? E€ Km(M7'A,M7'r®), r” € K,(AM—!,r°) , 


so that we can say that the stationary iterative method is used as precondi- 
tioner for GCR, or that GCR is used to accelerate the stationary iterative 
method. The optimality property (7.16) and Theorem 7.3.1 hold with A re- 
placed by AM~!. Of course, the method used for (7.21) can be anything; it 
may be chosen differently for every m, and it may be nonstationary. Then 
(7.16) and Theorem 7.3.1 do not apply, of course. In van der Vorst and Vuik 
(1994) it is proposed to use GMRES (to be discussed hereafter) for (7.21); 
the resulting method is called GMRESR. 


The GMRES method 


The GMRES (generalized minimal residual) method, proposed in Saad and 
Schultz (1986), generates the same iterates as GCR, but is cheaper. We will 
describe the method using both pre- and postconditioning, so that we solve 


Ay=b, Ä=CLACr, 9=CR ly, b= Cyd. 
The GMRES method computes 
g=9 +z”, z” €K,,(A,?°). 


We have 7?” = 6 — Ag = f° — Az™. GMRES finds z™ such that (7.17) holds 


in the ||- ||z norm: 
Flo = min{||F° — Az|lo, z € Km(A,7°)}. (7.22) 


The principles of the method are as follows (cf. Bruaset (1995), Saad and 
Schultz (1986)). Let {v}, ...,v} be an orthonormal basis for K,,(A, F°), and 
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let Vin be the nx m matrix with columns vt, ..., v";n xn are the dimensions of 


A. Then z™ = V,,u for some u € R”, and we have to minimize [F2 — AVin ule. 
Define 


hij (Av), v'), t= l, voy ds j = 1, eM, 


j 
hj4ij || Av? -Ý hyv‘ lle, j = l,m. 
1 


The (m + 1) x m matrix Hm+1,m with entries hj; is upper Hessenberg. It 
turns out that 


ÅVm = Vm+1Hm+1,m ' 
We can write 7° = Bv', B = ||F||2, so that VI,,7° = Bel, with (e1)? = 
(1,0,...,0) E R™t!. We have 
7° = AVmull2 = |l8v: — Vinti Hm+1,mull2 
= ||[Vin41(Ger — Hm+1,mu)|l2 
(Vin41 (Ber — Hm41,mu), Vine1 (Ber — Hmgi,mu)) 
(V41 Ving1(Ber — Hm41,mt), ber — Hm41,mu)? 


= |\Ber — Amyimulle , 


Il? — Azle 


since VZ 41V¥m41 is the identity. Hence, the n-dimensional least squares prob- 
lem (7.22) has been replaced by the following much smaller (m-dimensional) 
least squares problem: find u € R™ such that 


||8e* — Amtimull2 = min{||Be* — Hm+1,mu||2 : u E R”}. (7.23) 
The preconditioned GMRES method can be outlined as follows. 


Preconditioned GMRES method 
Choose y°; r° = b — Ay?; 
s° = Cir’; B= |Ir°|l25 vt = s°/8; j = 0; 
while stopcriterion not satisfied do 
j= jth 
w = C,ACrv: 
for k = 1,..., j do 
hkj = (v*, w); 
end for 
hj+ig = lwll; v+ = w/hj+ij; 
end while 
Solve (7.23), 
y™ = y? + CrVmu; 
The least squares problem (7.23) can be solved in the classical way, as sug- 
gested in Saad and Schultz (1986), by computing the QR factorization of 


278 7. Iterative methods 


Hm+1,m using plane rotations. The work involved is negligible if m & n. 


Any stationary iterative method can be used as preconditioner. For example, 
as noted in Sect. 7.2, if A is an M-matrix, incomplete LU (ILU) factorizations 
exist: 


A=M-N, M=LU, 


and we may simply take Cz = L~'!, Cp = UTL. ILU has proved to be very 
effective as preconditioning for Krylov subspace methods. For more infor- 
mation on ILU, see Chap. 3 of Barrett et al. (1994) and references quoted 
there. 


Robustness and convergence 


Since the minimization property (7.22) of GMRES is the same as the mini- 
mization property of GCR, the convergence behavior of the two methods is 
identical, apart from rounding errors. But GCR allows the use of a variable 
preconditioner in (7.21), which, as mentioned before, is taken advantage of 
in the GMRESR, method (van der Vorst and Vuik (1994)). Obviously, GM- 
RES requires less work and storage per step than GCR. A step of GMRES 
requires only one multiplication by A, and one iteration with the precondi- 
tioner. But in addition, the orthogonal basis and the matrix Hm+1,m have 
to be constructed, and (7.23) has to be solved. The latter task is negligible 
ifm <n. For a particular implementation, in Saad and Schultz (1986) it is 
estimated that construction of the basis and computation of y” from u takes 
m(m + 2)n+ mn multiplications, with ñ the number of non-zero entries in 
A. Furthermore, storage of {v}, ..., v™t1} takes (m+ l)n reals. As a conse- 
quence, in practice one is restricted to m & n. If the desired accuracy is not 
reached, one puts y? = y” and GMRES is restarted. Of course, this destroys 
the optimality property (7.22). Unlike GCR, there is no simple way to use 
truncation with GMRES. 


The efficiency of GMRES, GMRESR and Bi-CGSTAB (to be discussed later) 
is compared in van der Vorst and Vuik (1994), Vuik (1993). 


The conjugate gradient method 


In the special case that A is symmetric positive definite (SPD), the mini- 
mization problem (7.16) can be solved by the conjugate gradient method (CG 
method) in a much cheaper way than with GCR or GMRES. About the 
only SPD matrix to be encountered in computational fluid dynamics is the 
pressure matrix in orthogonal coordinates, in which case fast Fourier/cyclic 
reduction Poisson solvers may be applicable, and if so, are to be preferred. 
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But if not, CG methods are a good choice, so they will be discussed. 
Because A is SPD, A!/? exists. If the norm is chosen as follows: 
lri? = (471r, r) = A7 Prl, (7.24) 


with (-,-) the Euclidean inner product, then the following CG method solves 
(7.16): 


Conjugate gradient method 
Choose y?; p-! = 0; r° =b— Ay®; p-1 = 1; m= -—l; 
while stopcriterion not satisfied do 


m=m+ l; 
Pm = (r™, r”); Bm = Pm/Pm-1; p” =r” + Bmp” !; 
am = (Ap”, p”); Am = Pm/Tm; Yt! = yY” + amp; 


pmtl p” — ay, Ap™; 
Verify stopcriterion; 
end while 
We will now show that this algorithm solves (7.16). 


Theorem 7.3.2. Optimality property of conjugate gradient method 
The conjugate gradient method gives a residual r™ = b— Ay™ that satisfies 


1 
Ilr" || = min{||d(A)r"|] y € TL}, 


where ||r|| = |[A71/?r[2. 
Proof. That r™ = ¢dm(A)r® with ọm € IL. is easy to see. We have 

I (A)r’ l? = (ATT B(A)ro, v(A)r®) - 
Let y € [[}, be arbitrary. Then we can write 

PIE) = Om(E) + &Xm-1(6) 
with Xm-1 a polynomial of degree < m — 1. It follows that 
IA I? = lem (A)r? + []Axm—1(A) |]? + 2(x%m—-1(A)r°, bm(A)r®) . 

We have proved the theorem if we can show that 

(Xm—1(A)r°, dm(A)r°) = 0, Vxm-1- (7.25) 


We have rë = ¢4(A)r°. We assume a1, 02, ...,m Æ 0 (if not, we have stum- 
bled upon the solution: p, = 0 for some k < m, rk = 0 and (7.16) is obviously 
satisfied). Assuming A has full rank, r°,...,r™ are linearly independent, and 
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span (r°,...,7) = Km(A,r°) . 
Since X¥m(A)r° € span (r°, ..., r™), (7.25) is satisfied if 
(rë, r”) =0, k<m. (7.26) 
Hence, it suffices to establish (7.26). Using induction, assume 
(ps, Ap”) =0, k<m-1 (7.27) 
and 
(rf r) =0, kom. (7.28) 
For k < m — 1 we have 
(p*, Ap™) = (p*, Ar™ + Bm Ap” 1) = (Ap*, r™) = (rë — rët, r™)/ak = 0. 
Furthermore, 
(p™—", Ap™) = (pP? Ar™ + Bm Ap™—*) = (Ap™71, r™) + Bnom=1 
= (rt — r™ ™) fam 1 + Bmom-1 = 0, 
so that (7.27) holds with m increased by one. Continuing with (7.28), for 
k < m we have 
(rF tl) = (rë, r” — am Ap™) = —am(r", Ap™) 


= —Üm (p* — Brp t, Ap™) =0. 


Furthermore, 


(r7, rTI) = pm — am(r™, Ap™) = pm — am(p™ — Bnp™ 1, Ap™) 


so that (7.28) holds with m increased by one. Establishing (7.27) and (7.28) 
for m = 1 and m = 2 is left to the reader. 
m 


Definitions of the norm other than (7.24) result in other variants of the CG 
method. The fact that the search directions are conjugate with respect to A 
(equation (7.27)) gives the method its name. 
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Convergence and work 


The rate of convergence can be estimated by Theorem 7.3.1, which applies 
because of Theorem 7.3.2 with the corresponding norm. 


In general, o(A) is unknown. An upper bound on the number of iterations 
required that uses only the condition number is given by the following theo- 
rem. 


Theorem 7.3.3. Rate of convergence of the CG method 
The residuals generated by the CG method satisfy 


[Ir [I/[lr°ll < 2exp(—2m/yx) , 
with x the spectral condition number of A, and |\r|| = ||A7}/*rll2. 
Proof. The eigenvalues of A are Ay < A2 <... < An, Ai > 0. Define 


A(z) = EAr +n) + (An M). 


bo | — 


Then 
{Ai e An} C {A(z}: -lL<z< 1}. 


From Theorem 7.3.2 it follows that 
1 
Ilr || < [rl] min | max{|dm(A(z))| : z €[-1, 1}: dm € II 


It is known that the sharpest bound is obtained by 
w(A(z)) = p(z) = Tm(2)/Tm(20); zo = (Ar + An)/(An — Ar); 
with Tm the Chebyshev polynomial of degree m. This gives 
Ilr™ I /Ilr°l| < max{|o(2)| : z € [-1, 1} = 1/Tm (z0) . 
Tm has the following representation: 


(2+ VP 1" + e+ VPI", 


The spectral condition number is x = An/A1. We have 
K+] 


_ >, _vKt1 
70= Tq? tyz L= Te? 


Tm(z) = 
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Obviously, a step with CG is much cheaper than a step with GCR or GMRES. 
There is no Gram-Schmidt orthogonalization. Each step requires only 3 vector 
updates, two inner products and one matrix-vector multiplication, requiring 
O(sn) work, with s the number of nonzero diagonals in A. Furthermore, only 
A and 3 vectors need to be stored: p” (overwriting p™~!), r™ (overwritten 
by r™+1) and y™*! (overwriting y”). Hence, restarting is not necessary. 


If we require a reduction of the residual by a factor ch? with h a measure of 
the mesh size, because the discretization error is O(h?), then Theorem 7.3.3 
tells us that a sufficient number of iterations is 


i 
m > -3 Vein ch? /2 . (7.29) 


For elliptic problems we generally have « = O(h~*). In d dimensions, the 
number of unknowns n = O(h~4). The work of one iteration with the conju- 
gate gradient method is O(n), so the total work satisfies 


W=O(n" Inn), a=1+1/d. (7.30) 


This is not better than what can be achieved by stationary iterative methods 
using an optimal overrelaxation parameter. But the CG method derives its 
usefulness from the fact, that œ can be brought down by preconditoning, 
or, equivalently, that the CG method can be used to accelerate stationary 
iterative methods, as first pointed out in Meijerink and van der Vorst (1977). 
No guessing of an optimal parameter will be required. 


Implementation of the preconditioned conjugate gradient method 


For application of CG, the preconditioned system must be SPD. Therefore 
(7.20) is replaced by 


Aj=b, A=CACT, g=C7T, G=C"Ty, b=Cb, (7.31) 


so that A is SPD if A is, assuming C to be nonsingular, of course. When CG 
is applied to (7.31), the matrix C can be eliminated by making the following 
substitutions: r = Cr, p = C-Tp. Deleting the tildes, this results in the 
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following algorithm (where M = C~!C~?);: 


Preconditioned CG method 
Choose y°; p`! = 0; r° = b— Ay®; p_1=1; m=1; 
while stopcriterion not satisfied do 
m= m+ l; 
Solve Mz=r™; pm = (2™, r”); Bm = pm/Pm-1;3 
pr =z + Bmp"; Om = (Ap™,p™); am = Pm[Om; 
yr — y” + Amp”, r 
Evaluate stopcriterion; 
end while 
It remainds to find a suitable M = C~'C-T with M`! approximating A7?. 
The stationary iterative method (7.2) can be rewritten as 


ytti = y + M-1(b+ Ny" — My") =y* + MT (b Ay"), 


from which it is clear that M71 can be regarded as an approximate inverse 
of A. Solving Mz = r” in the preconditioned CG method is equivalent to 
executing one iteration with the stationary iterative method, with starting 
iterate zero and right hand side r™. Because we must have M = C7!C7? 
for some matrix C, M must be SPD. We give two examples. 


Symmetric Gauss-Seidel 


The forward Gauss-Seidel method is defined by M = R+D, D = diag(A), R 
the lower triangular part of A excluding the main diagonal. Clearly, M is not 
SPD and cannot be used to precondition CG. For backward Gauss Seidel 
we have M = RT + D, and an SPD matrix M is obtained by symmetric 
Gauss-Seidel (forward followed by backward), given by 
yt? = yk 4+ My*(b— Ay"), Mi=R+D, 
yet} — yt + MIT (b — Ay* tt?) , 


from which it follows that 


yt} = yk +M'(b- Ay"), Mo) = My) + M7” — My" AM;* 
= Mī’ + Mī’ - Mī" (Mı + Mi - D)M;* = Mī" DM;". 


We have, with z = M7 ty, if y £0, 
yT Mty =z" Dz: >0, 


because d;; > 0, since A is SPD. Hence M is SPD, which means that there 
exists an SPD matrix C such that C7? = C~!C-T = M. Note that in the 
preconditioned CG method the matrix C is not needed. 
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Incomplete Cholesky decomposition 


This corresponds to the symmetric version of ILU: 
A=M-N, M=EL". 


We now simply choose C = £7! in (7.31). This approach was proposed in 
Meijerink and van der Vorst (1977), leading to the well-known ICCG (in- 
complete Cholesky conjugate gradients) method. ICCG was found to be very 
effective in numerical experiments described in Meijerink and van der Vorst 
(1977), Kershaw (1978); it was also found numerically that (7.30) holds in- 
deed. Subsequently, incomplete factorizations have found widespread use as 
preconditioners for Krylov subspace methods. The rate of convergence de- 
pends on the spectrum of CACT. Since M~!A = CT(CACT)C-T, M = 
C-1C-?T, MT1A has the same spectrum as CACT. Upper bounds for the 
required number of iterations can be obtained by means of Theorem 7.3.3, 
which asks for the condition number. Theoretical estimates of «(M71A) can 
be obtained only in special cases. For the standard 5-point discretization of 
the Laplacian on a uniform square grid and a special choice for L it was 
shown in Gustafsson (1978) that «(M~1A) = O(n'/2), so that for the two- 
dimensional case (7.29) gives m = O(n)/4Inn), leading to a total work of 


W=O(n* Inn), a= 5/4. (7.32) 


This value of a seems to be representative of what can be achieved in general 
with preconditioned Krylov subspace methods. If the eigenvalues of A are 
clustered, Theorem 7.3.3 is pessimistic. In Bruaset (1995) (Sect. 4.2) a survey 
of incomplete factorizations is given, and pictures of the spectra of A and 
M—'A are shown. 


The Bi-CGSTAB method 


A method that applies for general A, but with computing work and storage 
demands per step comparable to those of CG (hence, much less than for GCR, 
and GMRES) is the bi-conjugate gradient stabilized (Bi-CGSTAB) method, 
proposed in van der Vorst (1992). However, for a general matrix A there is 
no minimization principle like (7.16), and convergence is not guaranteed. Al- 
though less robust, this method is frequently significantly more efficient than 
GMRES, because it is cheaper per step, and the restarts that GMRES may 
require (because of storage limitations) slow down its rate of convergence. A 
cheap version (Bruaset (1995)) is obtained if postconditioning rather than 
preconditioning is used: ACy = b, y = C71y. The algorithm is given by: 
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Postconditioned Bi-CGSTAB method 


Choose y°; 
r’ =b- ay; pP =q = 0; a=w9 = 1; 
while stopcriterion not satisfied do 
p= a(r®, r™— 1) 1 {wm—a(r®, rm ayy 
p” = Càr] — Bum—19g™~*) + Bp? 4; q™ = Ap”; 
a = (r°, r™1)/(r°, g"); 
am = r™-1 _ ag™; s™ = C3™; t™ = As™; 
Wm = (t™, 5) / (t,t); 
y™ = y™1 +ap™ + Wms”; r 
Evaluate stopcriterion; 
end while 


m om 


— mm. 
= 8" — Wmi"; 


Final remarks 


A survey of Krylov subspace methods is given in Barrett et al. (1994). The 
choice of a good preconditioner seems to be more crucial for efficiency than 
the choice of a particular Krylov subspace method. Although this is not 
a theorem, equation (7.32) is representative of the computing work that is 
needed in the SPD case; in the general case, this will be more. An interesting 
comparison of various iterative methods is given in Botta et al. (1997). We 
will now discuss a class of methods that have the more optimal complexity 
W = O(n). The price to be paid is more complicated programming. 


7.4 Multigrid methods 


Here only a brief outline of multigrid methods will be sketched. Full introduc- 
tions are given in Hackbusch (1985), Wesseling (1992). Furthermore, the ele- 
mentary introduction Briggs (1987) and the pioneering paper Brandt (1977) 
can be recommended to those who want to get acquainted with the subject. 
The discovery of multigrid methods and (more general) multilevel methods 
(cf. Brandt (1977)) is the most significant development in numerical analysis 
in the last 25 years, and is having a strong impact on computational fluid 
dynamics. A review of the application of multigrid in computational fluid 
dynamics is given in Wesseling and Oosterlee (2000). The multigrid website, 
maintained by C.C. Douglas, is at www.mgnet.org. 


The multigrid principle 


Assume we have a discretization of a partial differential equation on the d- 
dimensional unit cube 2. Let the computational grid 2; be uniform with 
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stepsize h; in all directions; the role of the index l (denoting the so-called 
level) will become clear shortly. The problem to be solved on the grid §2; is 
given by 


Ay =b. (7.33) 


To begin with, a small number of iterations is performed with some stationary 
iterative method. Let this be done with, in pseudo-programming language, 
by a call S(A',b', y!) to a subroutine S, called the smoother. Here y! is both 
an input parameter (the initial iterate) and an output parameter (the ap- 
proximation to the solution of (7.33) after smoothing); the dual use of the 
symbol y' (solution of (7.33) and current iterate) should cause no confusion. 
As noted in Sect. 7.2, convergence is expected to be slow, with spectral ra- 
dius 1— O(h?). However, it frequently happens that the error is made smooth 
rapidly. This is called the smoothing property; for a precise mathematical def- 
inition see Hackbusch (1985) Sect. 6.1.3, Wesseling (1992) Sect. 6.5, and this 
property is assumed to hold here. 


The basic idea is now to approximate the error on a coarser grid §2;_1 with 
hi_1 = 2h,; the number of cells in §2; is assumed to be even in all directions. 
Fig. 7.1 gives a one-dimensional example. 


Prolongation and restriction 


Let U* : Qk — R be the set of grid functions on 2, and let there be given 
0 1 2 3 4 5 


0 l 2 
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Fig. 7.1. Fine and coarse grid 


a prolongation and restriction operator: 
P: UT’ U, Rou saul}, (7.34) 


For example, in one dimension we could have, using linear interpolation, with 
the numbering system of Fig. 7.1, 


~ — - l, — 
(P'y Haj = Y; t, (P'y' Haj+1 = A 14 y4) ' (7.35) 


and a natural restriction would be 


(Rty); = yhy. (7.36) 
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It turns out that (7.36) is not good. For information on the requirements 
that P’ and R'~! should satisfy, see Hackbusch (1985) (p. 149) or Wesseling 
(1992) (p.71). A good restriction is 


_ 1 
(rity) = z (Ya + 2y; + Yoj41) . 


Coarse grid correction 


After smoothing, the second step in the algorithm is approximation of the 
error on the coarse grid. The error in the current iterand y/, called e', satisfies 


Ale’ =b — Aly’. (7.37) 


Let A‘! be an approximation of A! on the coarse grid 2'71, obtained for 
example by discretization of the underlying partial differential equation on 
N!-1. Then we solve 


Ally} — pi-1 = R10 _ Aly) ; 


Next, an approximation to e' (also called e') is obtained by interpolation to 
R: 

e = ply’), (7.38) 
If, as assumed here, the original error (satisfying (7.37)) is smooth, then 
(7.38) may be expected to give a good approximation to the error. This can 
be formulated mathematically in a precise way, and is called the approzima- 
tion property (Hackbusch (1985) Sect. 6.1.3, Wesseling (1992) Sect. 6.5}. The 


reason why (7.36) is not good is that the approximation property does not 
hold, as it turns out. The coarse grid correction is added to y': 


y =y +e. (7.39) 
Finally, smoothing may be applied to y', and one two-grid iteration has been 
completed. 
Two-grid algorithm 


The foregoing leads us to the following iteration method. The coarse grid 
approximation to the error e'~! is called y'~?. 
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Two-grid method 
Initialize y'; 
do until convergence 
S(A‘, by’); 
ji-l = R'- 10! — Aly’); 
Solve A’-ty’-! = b-t, 
Iai i l-1. 
y =y +Py ; 
S(A', b, y); 
Check convergence criterion; 
end 


How to solve on §2;_1 is discussed below. 


Efficiency of multigrid method 


The nice thing about the two-grid method is that its rate of convergence 
is independent of hı, provided the smoothing and approximation properties 
hold. These requirements are not difficult to satisfy, as it turns out; for more 
information see Brandt (1977), Hackbusch (1985), Wesseling (1992). 


Let Ny be the number of unknowns on §2,. Then the work required for one 
two-grid iteration excluding the solve step on the coarse grid is cN;, with ¢ 
some constant. Denoting the work of the solve step by Wi_1, the work W, 
for one two-grid iteration is 


W, = cN, + Wii. (7.40) 


We obtain a true multigrid method by replacing the solve step by y two-grid 
iterations employing a still coarser grid 9-2, and so on recursively, till a very 
coarse grid {2 is reached, on which the cost of solving is negligible, and with 
No independent of N;. This means that if we refine §2;, the number of levels 
l is increased. Then we have, with d the dimension of §2, for the number of 
grid points on §2* 


Nz = 2°4No , (7.41) 
and the computing work satisfies the following recursion: 
Wi = c2¥4No +yWp-1, k= 1,2,...,0. (7.42) 


To obtain a mathematically elegant formulation we assume Wo = cNo. This 
may be incorrect, but this is of no consequence because Wo is very small. 
Then it follows that 


Wi = c2!4No(1 t+ y(274 + 9(2724 + + y27!4)...) 


cN(1+74+77 4.0.44), J= 7/2". (7.43) 


II 
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Assume 
y<l1. (7.44) 
Then 
W, <cN/(1— 7). (7.45) 


The following conclusions may be drawn from (7.45). The quantity eN; is 
the work performed on the finest grid. In practice the bulk of this work is 
in the smoothing, which consists of a very small number of iterations with 
a simple stationary iterative method. The work on the coarse grids adds to 
this 1007/(1 — ¥) percent. It turns out that usually y = 1 or y = 2 suffices 
to obtain a rate of convergence independent of hı. Taking y = 2, we get 


4 
W, < 2cN; (d= 2), Wi < zeM (d = 3). 


So the cost of a complete multigrid cycle is equivalent to the cost of just a 
few steps with the smoothing method. 


Because the rate of convergence is independent of h;, O(|ln hkj) = O (ln Ni) 
iterations are required to obtain an error reduction of O(h?). Since the work 
for one iteration is O(N;) the total computing work satisfies 


W = O(N, InN). (7.46) 


The logarithmic factor in (7.46) may be removed by changing the multigrid 
algorithm, namely by starting computations on the coarsest grid 2o (nested 
iteration or full multigrid), cf. Brandt (1980), Hackbusch (1985) Chap. 5, 
Wesseling (1992) Sect. 8.4, resulting in optimal computational complexity: 


W = O(N). 


This unique property has been proved rigorously under fairly general cir- 
cumstances, see Hackbusch (1985). There is no other method known which 
achieves this. Consequently, multigrid methods have a strong impact on com- 
putational fluid dynamics. 


Storage requirements 


The additional storage requirements for the computations on the coarse grids 
are quite modest. A reasonable estimate for the storage required om grid Ry 
is ¢N;, with c some constant which is the same on all grids. Total storage is 
therefore 


oN (14274 427744 00.) < oN 24/(27 — 1), 


so that compared to single grid solution only about 1/3 extra storage is 
required in two dimensions and 1/7 in three dimensions. 
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Nonlinear multigrid method 


A powerful property of multigrid is that it is directly applicable to nonlinear 
problems. Let the nonlinear discrete problem to be solved on {2; be denoted 
by 


Aly) =b. (7.47) 
The nonlinear two-grid algorithm is given by 


Nonlinear two-grid method 
Initialize y'; 
do until convergence 
(1) S(A‘, by’); 
9 pl — b! — Al(y’); 


(2) 
(3) Choose J71, s)-1; 
(4) bil = AIJ) + 5), Rtr: 
(5) Solve A’}(y'-!) = b7 t; 
(6) y = yf + (1/si-1)p'(¥* — #77); 
(7) S(A‘, By’); 
Check convergence criterion; 
end 


We proceed with a discussion of this algorithm. S represents some nonlinear 
smoothing iteration, for example nonlinear Gauss-Seidel. In (2) the residual 
is computed, which is going to steer the coarse grid correction. In this non- 
linear case we cannot use r’ as right-hand side for the equation for the error; 
Al(e') = r! might not even have a solution. For the same reason, R'~!r! 
cannot be the right hand side for the coarse grid problem on §2;_1. Instead, 
it is added in (4) to A'~1(y'-1), with g! sufficiently close to the solution 
in some sense; choosing j/—! = R'~'y' would be a good idea. Obviously, 
A'—t(yl-1) = A'-1(#-1) has a solution, and by choosing s;_; small enough 
in (4), (5) also has a solution. In (6) the effect of s;_1 is undone again, and 
y'—! — #-? is a coarse grid approximation to the error. It is easy to see that 
in the linear case the nonlinear two-grid algorithm is equivalent to the linear 
two-grid algorithm. Extension from two-grid to multigrid is obvious. 


Unlike the Krylov subspace methods discussed before, the nonlinear multigrid 
method does not need global linearization for application to nonlinear prob- 
lems; only local linearization in the smoother is required. This is an advan- 
tage, because computation of a global Jacobian may be expensive, and it may 
be ill-conditioned, for example for transonic flows. On the other hand, pro- 
gramming multigrid methods is more complicated than programming Krylov 
subspace methods. For comparison of Krylov subspace and multigrid meth- 
ods, see Wesseling (1992) Sect. 8.9 and references quoted there, and Zeng, 
Vuik, and Wesseling (1995). 
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Smoothing analysis 


Usually, the smoother is the most computing intensive and critical part of 
multigrid. In general, the smoother is a simple stationary iterative method. 
Therefore, like Krylov subspace methods, multigrid may be viewed as a way to 
accelerate the convergence of stationary iterative methods. If multigrid does 
not converge rapidly, this is mostly because the smoother does not reduce 
all non-smooth components of the error effectively. Apart from changing the 
smoother, this can often be remedied by accelerating multigrid by a Krylov 
subspace method, by using a multigrid iteration as preconditioner. This has 
been done for example in Kettler (1982), Braess (1986), Zeng, Vuik, and Wes- 
seling (1995), Washio and Oosterlee (1997). 


Not every convergent iteration method has the smoothing property. But by 
introducing damping every convergent stationary iteration method can be 
made into a smoother, as noted in Wittum (1989a), cf. Wesseling (1992), 
p.98. Therefore it is desirable that A’ is an M-matrix, because then conver- 
gent methods and hence also smoothing methods are easy to find. 


The smoothing property of stationary iterative methods can be investigated 
by Fourier smoothing analysis. To make Fourier analysis applicable, the co- 
efficients in the differential equation are made constant (by ‘freezing’ local 
values just as for Fourier stability analysis in Sect. 5.5), and the grid is taken 
uniform and the boundary conditions periodic. Then discretizations of partial 
differential equations can be written as 


Linh; = 5 AkYj+k = Fis j = (ji, vey Ja), k= (kis wey ka) ; 
kek 


with d the number of dimensions. A stationary iterative method based on 
splitting (Sect. 7.2) corresponds to a splitting A = Kı U Ko, and leads to the 
followig iteration scheme: 


>, akik =- > anvirk + Fj (7.48) 


KEK, ke K> 


The error € satisfies (7.48) with f; = 0. According to Theorem 5.5.1 we can 
write 
ey = 5 cme? . 
IEO 


Substitution in the homogeneous version of (7.48) gives, because the bound- 
ary conditions are periodic (cf. (5.20)): 


cp tt = g(B)ce, 9(0)=— > ane™®?/S- are. (7.49) 
Ko Kı 
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We have the smoothing property if |g(@)}| < 1 for all non-smooth wave- 
numbers @. It is clear that all |3| < 1 are smooth wave-numbers. Somewhat 
arbitrarily, the sets ©, and O, of smooth and rough wave-numbers are defined 
by 


O, = {0 € O: |0;|< 7/2, j=1,..,d}, 0, =O\O,. (7.50) 
The smoothing factor p is defined by 
p = max{|g(@)|: 0 € O,}. (7.51) 


We have the smoothing property if p < 1. The smoothing factor J is easily 
determined numerically or analytically; see Wesseling (1992) (Chap. 7) for 
many examples and discussion of the influence of boundary conditions. 


It turns out that in many cases ð is much smaller than 1. For example, 
for Laplace’s equation in two dimensions we have p = 1/2 for the Gauss- 
Seidel method and # = 0.13 for the 7-point ILU method. For a well-designed 
multigrid method the error reduction factor is often of about the same size 
as p, so that the rate of convergence is not only independent of the mesh 
size, but also fast. Chap. 9 of Wesseling (1992) and Wesseling and Oosterlee 
(2000) may be consulted for application of multigrid to computational fluid 
dynamics. 


7.5 Fast Poisson solvers 


Certain methods developed especially for the Poisson equation have come to 
be known as fast Poisson solvers. Because they are so efficient, this class of 
direct methods gets an honorary position in this chapter on iterative meth- 
ods. 


Solving Poisson’s equation for the pressure correction takes most of the time 
in computing nonstationary incompressible viscous flows on staggered grids, 
unless a fast Poisson solver is used. Without it large-eddy simulation of tur- 
bulence would not be feasible. Although fast Poisson solvers are restricted 
to orthogonal coordinates on rectangular domains, their importance for the 
applications just mentioned warrants a brief discussion here. 


By fast Poisson solvers one means solution methods based on the fast Fourier 
transform and/or cyclic reduction. A survey of these methods is given in 
Swarztrauber (1984). They will not be described here, but we will indicate 
when they can be applied. Let us have a rectangular computational grid and 
let (j,k) be grid point indices in the usual way; j, k € N. Cyclic reduction is 
applicable if the stencil of the discretization is of the following form: 
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€k 
a; bik Cj (7.52) 
dy, 


That is, the coefficients referring to horizontal neighbours may depend only 
on the horizontal grid point index j, and similar in other directions; cyclic 
reduction is applicable in more than two dimensions. The fast Fourier trans- 
form can be brought into play when in one or more directions the co- 
efficients are independent of the index in that direction; for example, if 
a; = da, bik = bk, cj SC. 


A stencil like (7.52) is possible only if the underlying partial differential equa- 
tion is separable. That is, the coefficient of a derivative ĝ™ /ĝz? is allowed to 
depend only on x,. Furthermore, no mixed derivatives may occur, which re- 
stricts us to orthogonal coordinate systems, so that general domains treated 
with boundary-fitted coordinates are out of bounds. The grid need not be 
uniform, but the step size in the horizontal direction may depend on j, and 
similarly for the other directions. We see that fast Poisson solvers are appli- 
cable to more general equations than the Poisson equation, but they were 
first developed for Poisson, and the name has stuck. 


Fast Poisson solvers are of interest because they are fast. The computing 
work W satisfies W = O(N ln N), or sometimes even O(N inln N), with 
reasonable constants of proportionality, and little storage is required. These 
methods are mature, and good computer codes are readily available on the 
Internet: math.nist.gov/cgi-bin/gams-serve/class/I2biaia.html. 


Because they are so efficient and convenient, effors have been made to some- 
how use fast Poisson solvers for more general problems (cf. Swarztrauber 
(1984)), such as problems on non-rectangular domains, or nonseparable equa- 
tions. But efficiency suffers, and we feel that if more generality and flexibil- 
ity are required, preconditioned Krylov subspace and multigrid methods are 
preferable. These have the potential to achieve similar efficiency in the non- 
separable case. 


7.6 Iterative methods for the incompressible 
Navier-Stokes equations 


The algebraic system to be solved 


We start with the discretization on a staggered grid in the stationary case. If 
we delete the time derivative in equation (6.42), the algebraic system to be 
solved may be denoted as 


(30) G)=(4)- (7.53) 
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This can be looked upon as a linear system arising in an iterative procedure 
to solve (6.42) with nonlinear N, but we can for the time being also retain 
the possibility that N is a nonlinear operator on u. The nonlinearity is due 
to the inertia terms. 


A crucial fact is that we do not have a K-matrix, nor an M-matrix (in the lin- 
ear case), even if upwind discretization is used for convection, because we have 
a zero block on the main diagonal in (7.53). This precludes straightforward 
application of the iterative methods discussed before, and special methods 
have to be used. The literature is vast, and we can give only a few illustrative 
examples. First we present a general strategy to deal with non-M-matrices. 


Distributive iteration 


Suppose we have a linear system 
Ay=b. (7.54) 
The system is postconditioned: 
ABy=b, y= By. (7.55) 


The postconditioning matrix B is chosen such that (7.55) is easier to solve 
iteratively than (7.54). For example, AB is an M-matrix, while A is not. To 
(7.55) the iterative methods discussed before can be applied in straightfor- 
ward fashion. For the iterative solution of (7.55) the splitting 


AB=M-T (7.56) 


is introduced, to which corresponds the following splitting of the original 
matrix A: 


A=MB~'-TB'. 
This leads to the following stationary iterative method for (7.54): 
MB71yk+1 =TB-lyk +b, 
or 
ytl = yë + BM *(b— Ay*). (7.57) 


Note that the iterative method defined by (7.57) is obviously consistent for 
any nonsingular B and M: if it converges, it solves Ay = 6. This means that 
when designing a distributive method, B and M in (7.57) need not be pre- 
cisely the same as in (7.56). For example, one could use a complicated B to 
design a good M for (7.56), possibly even with T = 0, and then change M a 
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little in (7.57) to make My = c easily solvable, and also B may be simplified 
to make (7.57) easy to apply. 


The method (7.57) is called distributive iteration. Equation (7.57) shows that 
the correction M~1+(b— Ay") corresponding to non-distributive (B = J) iter- 
ation is distributed, so to speak, over the elements of y*+1, whence the name 
of the method, coined in Brandt and Dinar (1979). 


Distributive iteration for Navier-Stokes 


A general treatment of distributive iteration is given in Wittum (1989b), 
Wittum (1990a), Wittum (1990c), Wittum (1990b), where it is shown that a 
number of well-known iteration methods for the incompressible Navier-Stokes 
equations can be interpreted as distributive methods. Examples will follow. 
The distributive iteration framework permits us to give a unified description. 


The system to be solved is (7.53). Define 


A=(56)- (7.58) 


A distribution matrix B such that AB is of block-triangular form: 


AB = ( z 3) (7.59) 


would be attractive. In the first place, splittings AB = M — N are easily 
obtained by splitting Q and S, leading to simple separate updates for velocity 
and pressure. Secondly, in case one wants to use multigrid, smoothing analysis 
is simplified; in Wittum (1990a) it is shown that if the splitting is regular, 
then we have the smoothing property if we have it for the splittings of Q 
and S, so that only uncoupled equations for ut, u? and p separately have to 
be analyzed. A possible choice for B is, with the same block structure as in 


(7.58): 
_f I Bry 
B= Ç Bog ) , 


_{N _ NBr + GB 
AB = (5 oe ) | 


resulting in 


Choosing B such that N Biz + GB22 = 0 results in the block triangular form 
(7.59). Therefore we choose 


By, = —N7'GB2 , 
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resulting in 


N 0 - 
AB = (> e) , C=-DN'GBy, (7.60) 


with B22 still to be chosen. The main difficulty in the original formulation 
(7.53), inamely the zero block on the main diagonal, has disappeared. If 
upwind discretization is applied to convection, then N is an M-matrix. If 
B22 is chosen such that C is also an M-matrix, chances are that AB is an 
M-matrix, making the system suited for iterative solution. Various methods 
result from the choice of B32. 


SIMPLE method 


A method widely known in the engineering literature as the SIMPLE method 
(‘Semi-Implicit Method for Pressure-Linked Equations’) is proposed in 
Patankar and Spalding (1972) and discussed in detail in Patankar (1980). This 
is perhaps the oldest and most widely used iterative method for the station- 
ary Navier-Stokes equations. Many variations and improvements have been 
proposed, such as SIMPLER (Patankar (1980)), SIMPLEC (Van Doormaal 
and Raithby (1984)) and PISO (Issa (1986)). Without in any way disparaging 
the creativity that has gone into the design of these methods, nor denying 
the progress they have brought, it must be said that their formulation in 
the engineering literature is often ad-hoc and obscure; even their consistency 
with the system to be solved is not easy to verify. The distributive itera- 
tion framework, however, results in a unified and transparent formulation. 
To write some variants in a distributive formulation, not only postcondition- 
ing but also preconditioning must be applied. 


The SIMPLE method is obtained by choosing 
Bz = 1, 


so that (7.60) becomes 


N 0 
AB = (> _py-ig ) 
A splitting AB = M — T is defined by 
_{Q 0 
M = (5 R)’ 
where Q and R are approximations to N and —DNT!G such that 


My = b is easily solvable. For the distribution step in (7.57) B is approxi- 
mated by 
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I -NG 
B= (a rS): 

where N~! is an easy to evaluate approximate inverse of N (in the nonlinear 
case one may think of N~1(f) as giving an approximate solution of N(u) = f 
by some iterative process}. Depending on the choice of N~, Q and R, various 
variants of the SIMPLE method are obtained. N just represents a set of 
convection-diffusion equations and it is easy to think of simple stationary 
iterative methods (linearizing globally or locally) to use, thus determining 
Q. In the original SIMPLE method, one chooses N = diag(N). This makes 
DN-'G easy to determine. It has a simple 5-point stencil in two or 7-point 
stencil in three dimensions, and resembles the standard discrete Laplacian; it 
is a K-operator, and some simple stationary iterative method may be applied, 
thus determining R. 


This results in the following algorithm. Its basic form is given by (7.57). In 
the present case we have 


(EDG om 


After computing the residuals rf, rf, a preliminary velocity correction ĝu is 
computed by solving 


k 
Qôu =r]. 
Next, a preliminary pressure correction is computed by solving 
Rôp = r$ — Dôu . 


In the distribution step new corrections are obtained by 


OMOR Graa] 


Finally, in the SIMPLE method underrelaxation is applied with parameters 
wu and wp: 


k+1 k 


uth = uf 4 wuu, pët! = p* 4+ udp. (7.62) 


Robustness and efficiency properties of this type of methods are similar to 
those of the stationary iterative method implied by Q for the convection- 
diffusion model problem (7.12), combined with those of the stationary itera- 
tive method implied by R for the rotated anisotropic diffusion model problem 
(7.13). This means that convergence will slow down upon grid refinement. If 
there are grid cells with high aspect ratio, or if there are significant mixed 
derivatives due to the use of strongly non-orthogonal coordinates, or if con- 
vection dominates, one of the robust methods listed in Sect. 7.2 should be 
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used. For high Reynolds numbers first order upwind needs to be used for con- 
vection in order to make N an M-matrix (roughly speaking), so that defect 
correction will be needed for better accuracy. The introduction of the relax- 
ation parameters in (7.62), or using variants of SIMPLE such as SIMPLEC 
or PISO, do not give essential improvement of efficiency and robustness; see 
Van Doormaal and Raithby (1984) for a comparison. Like stationary itera- 
tive methods, this kind of method asks for acceleration by Krylov subspace 
or multigrid methods. Multigrid acceleration has been investigated, with fa- 
vorable outcome. The smoothing factor of SIMPLE has been investigated in 
Shaw and Sivaloganathan (1988). Theory concerning the approximation and 
smoothing properties of distributive iterative methods for the Navier-Stokes 
equations is presented in Braess and Sarazin (1996) and Wittum (1990a). 


Distributive Gauss-Seidel method 


This method is proposed in Brandt and Dinar (1979) and put in distributive 
iteration formulation in Hackbusch (1985). It was designed right from the 
start as a smoothing method to be used with multigrid. This method does 
not bring AB exactly in the block triangular form (7.59). Choosing 


I -G 
a=(f Da) 


N GDG—NG 
AB = (5 -DG ) . (7.63) 


we obtain 


The splitting AB = M — N corresponds to the Gaus-Seidel method. The 
following algorithm results from (7.57). Residuals rf, r are computed ac- 
cording to (7.61). Next, one Gauss-Seidel iteration is applied to 


Nóu = rk 


with initial guess du = 0. This may be a nonlinear Gauss-Seidel iteration; 
previous linearization of N is not necessary. Then one Gauss-Seidel iteration 
with zero initial guess is applied to 


DGôp = Déu. 


Then the distribution step is carried out: 


(i) = (0 be) (5p) 


and finally 
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u®tl ut 4 bu, pti =p 4 bp. 


The following difficulties may arise. The Gauss-Seidel splitting of AB is de- 
pendable only if AB is an M-matrix. This requires using upwind discretization 
for convection for high Reynolds numbers, making N an M-matrix (roughly 
speaking). Furthermore, GDG — NG must be small, so that M corresponding 
to Gauss-Seidel is a sufficiently accurate approximation of AB. For the Stokes 
equations (Re = 1, no convection) it may be shown that GDG — NG = 0 in 
the interior, but not at the boundaries (Hackbusch (1985), Wittum (1989b)). 
For high Reynolds numbers, however, GDG — NG need not be small. For 
the Stokes equation a good smoothing factor (the same as for Gauss-Seidel 
applied to the Poisson equation) is found in Brandt and Dinar (1979), and 
in Wittum (1989b) numerical experiments show satisfactory multigrid con- 
vergence. But for high Reynolds numbers the method as it stands behaves 
less satisfactorily. Roughly speaking, for Navier-Stokes the method behaves 
very much like the Gauss-Seidel method for the convection-diffusion equa- 
tion. The measures that can be taken to improve robustness in the latter 
case (cf. Wesseling (1992), Chapter 7) can be extended to the former case. 
The following method provides an example. 


Distributive ILU method 


This method is introduced in Wittum (1989b). The distribution matrix B is 
the same as in the preceding method, but M is derived from incomplete LU 
factorization: 


AB=LU—-N. 


For an introduction to ILU and references to the literature, see Wesseling 
(1992) Chap. 4. Just as for the convection-diffusion equation, ILU gives bet- 
ter efficiency and robustness than the Gauss-Seidel variant disussed before. 
Furthermore, the block GDG — NG in AB in (7.63) is taken better care of, 
and satisfactory multigrid convergence is obtained for high Reynolds number 
flows (Wittum (1989b), Wittum (1990c), Wittum (1990b)). 


A distributive ILU method based on a SIMPLE type distribution matrix is 
proposed in Zeng and Wesseling (1995). We choose 


I —-N-'G 
B= E CI ) , (7.64) 


where the parameter Ç is introduced to enhance multigrid convergence. This 


gives 


aB= (5 Gne) 
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In order to construct a suitable splitting of AB, N`} is replaced by 
N-21, N =diag(N). This changes AB in AB. With ILU one chooses 


M=(L4+D)D"\(D+U), 


with Z and U strictly lower and upper triangular, respectively, and D diag- 
onal. L, D and U are determined from 


Mu = (AB), (k,1)EG, Mu =0, (k,l) gG, 
where the index set G is choosen as follows: 
G = {(k,k), (k,k +1), (k,k+K +1), (k,k +K FD}, 


where K is the number of cells in the horizontal direction. ILU depends on 


the ordering of the unknowns. In Zeng and Wesseling (1995) ule» uj_,, and 
pj are put behind each other, and j increases lexicographically. The iteration 


method becomes, according to (7.57): 
yt) = yf + wBM—1(b— Ay"), 


where B is a simplification of B in (7.64): 


B= (G -Ñ e) 
0 çI 
and w is another relaxation factor. Numerical experiments with multigrid 
described in Zeng and Wesseling (1995) show that fixed values of w = 0.7 
and Ç = 2 result in rapid multigrid convergence for several flow problems in 
non-rectangular two- and three-dimensional domains, for medium and large 


Reynolds numbers. Galerkin coarse grid approximation is used as described 
in Zeng and Wesseling (1994). 


Symmetric coupled Gauss-Seidel method 


The SCGS method is proposed in Vanka (1986a) as a smoother to be used in 
multigrid. It is not of the distributive type (7.57). Each cell is visited in turn in 
some prescribed order, in Gauss-Seidel fashion. The five unknowns associated 
with a cell 2; (in the notation of Sect. 6.4), namely u},.,, ute, and pj, are 
updated simultaneously, while keeping the other unknowns fixed. Note that 
the velocity variables belong to two cells; for example, uje, belongs to 22; 
and §2;_2¢,. Hence, they are updated twice in a sweep over the domain. The 
convection terms are linearized. The system for the corrections to the five 
variables associated with 92; looks like 
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a @i2 «413° «44? fule, 

a21 a22 a23 a34 —h7} tipe 

&31 392 a33 a34 hz" bus e, =r. 
a41 G42 043 d — hy! Su? te, 

hy’ —hy* hy* —hy* 0 bp; 


This system is approximated by dropping the off-diagonal agı, and damping 
is introduced by dividing the diagonal elements by damping factors, that 
hopefully can be fixed at problem-independent values. The result is 


a11/01 0 0 0 hy? uie, 
0 422/01 0 0 —hy? Suite, 
0 0 a33/02 0 hz" bud, =r. (7.65) 
0 0 0  as4/02 —h3' ÔU? je, 
hy’ =h} hy! -hy’ 0 Op; 


This system can be written in the following partioned form: 


(a 0) (= Gs) 

A? 0 Uz} \ be}? 

where U; contains the du unknowns and U2 = dp;, and its solution is given 
by 


Uz = (AZ Ay 'bi — b2)/AZ Ay Ag, Uy = AT} (b1 — AU). 


Since A, is a diagonal matrix, this is easily evaluated. The procedure ex- 
tends effortlessly to the three-dimensional case; Vanka (1986b) gives a three- 
dimensional application. 


The method is called symmetric because the four velocity unknowns associ- 
ated with a cell are treated the same way; it is called coupled because the 
velocity and pressure unknowns are updated simultaneously, and it is called 
a Gauss-Seidel method because the cells are visited sequentially in Gauss- 
Seidel fashion. 


The method has been applied only in conjuction with multigrid. Suitable val- 
ues for the relaxation factors c1, and o2 must be determined empirically. Usu- 
ally, one can take o1 = o2, and optimum values are found to range between 
0.3 and 0.8 (Vanka (1986a), Vanka (1986b), Oosterlee and Wesseling (1992b), 
Oosterlee and Wesseling (1992a), decreasing with increasing Reynolds num- 
ber. In Wittum (1990c) it is found that with c1 = e} = 1 SCGS is still an 
acceptable smoother. Fourier smoothing analysis of SCGS and numerical ex- 
periments are presented in Sivaloganathan, Shaw, Shah, and Mayers (1988). 
Sivaloganathan (1991), who find SCGS to be a more efficient smoother than 
SIMPLE, whereas in Wittum (1990c) distributive ILU is found to be a bit 
more efficient than SCGS. The method works for high Reynolds numbers. For 
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robustness with respect to large cell aspect ratios it is necessary to update 
rows and columns of cells simultaneously, as is to be expected from the analy- 
sis of the model problems (7.12) and (7.13) in Wesseling (1992) Chap. 7. This 
can be kept relatively simple and cheap, although the systems to be solved 
are more complicated than (7.65). The method is described and applied in 
Thompson and Ferziger (1989), Oosterlee and Wesseling (1992c), Oosterlee 
and Wesseling (1993), Oosterlee and Wesseling (1994), and its robustness and 
efficiency are found to be satisfactory. 


The colocated case 


We will now discuss the case of spatial discretization on a colocated grid with 
the pressure-weighted interpolation method of Rhie and Chow (1983). The 
system to be solved may be written as, deleting the time derivative in (6.31): 


De)G)=Q). 


where B in (6.31) is renamed C', reserving the symbol B for the distribution 
matrix. Distributive methods are called for here as well, although now there 
is no zero block on the main diagonal. But C is not a K-matrix, as is easily 
seen from (6.26). So, not surprisingly, iterative methods proposed in the lit- 
erature are usually inspired by SIMPLE, and of distributive type, although 
the distributive formulation is not used to describe the methods. 


The iterative method proposed in Rhie and Chow (1983) can be formulated 
as follows. The distribution matrix is of SIMPLE type, giving 


AB = NG I-N'G\_(N 0 
“\DCJ\O I TID C-DN?'G)° 
A splitting AB = M — N is defined by choosing 
_{Q 0 
M = (5 R)’ 
where Q is an approximation to N corresponding to some iterative method 


for the momentum equations without the pressure, and R is a similar ap- 
proximation to -DN~'!G with N = diag(N). In the distribution step B is 


chosen as 
_ [1 -ÑG 
B= (5 A70) 


With (7.57) the following iterative method results. Compute: 


CH=) - (be) Gr) 
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Solve: 
Qsu=ri, Réip=rs ~ Dôu. 
Distribution step: 
ĝu := ĝu — N` tGôp . 


Finally, 


k 


k+l uF 4 du, p 


u H = pë + wydp. 

Of course, concerning robustness and efficiency the same remarks can be made 
as for the SIMPLE method on staggered grids; these properties depend on 
the choice of Q and R. In Rhie and Chow (1983) turbulent flow calculations 
on boundary-fitted grids around airfoils are reported to converge in from 500 
to 1000 iterations. See Acharya and Moukalled (1989), Kobayashi and Pereira 
(1991), Miller and Schmidt (1988) for variations on the SIMPLE theme in 
the colocated case. Multigrid acceleration is described in Barcus, Perić, and 
Scheuerer (1988), Lien and Leschziner (1994), Moukalled and Acharya (1991), 
Zang, Street, and Koseff (1994). 


The pressure-correction method 


For nonstationary problems the pressure-correction method is efficient. The 
resulting algebraic systems are easier to solve than before, because the 
pressure-correction method uncouples velocity and pressure. With explicit 
schemes, the velocity update requires little effort, and computing the pres- 
sure correction is the computational bottleneck. Where applicable, a fast 
Poisson solver as described in Sect. 7.5 should be used. When this is not the 
case, as for instance in non-orthogonal coordinate systems, a Krylov subspace 
or multigrid method should be used. Efficiency and robustness are as for the 
anisotropic diffusion model problem (7.12). Application of preconditioned 
GMRES is described in Vuik (1993). For implicit schemes, systems have to 
be solved for the velocity components as well. How this can be done with 
preconditioned GMRES is described in Vuik (1993), Vuik (1996). Efficiency 
and robustness are as for the convection-diffusion model problem (7.12), dis- 
cussed in Sect. 7.2. A comparison of Krylov subspace and multigrid methods 
and combination of the two (Krylov subspace preconditioned with multigrid) 
is presented in Zeng, Vuik, and Wesseling (1995). 


8. The shallow-water equations 


8.1 Introduction 


We now enter the realm of hyperbolic equations. Instead of starting with an 
introduction to numerical methods for hyperbolic equations in general, we 
prefer to begin by plunging into the particular case of the shallow-water equa- 
tions. Other hyperbolic equations will be considered in Chapters 9 and 10. 


The derivation of the shallow-water equations from the Navier-Stokes equa- 
tions is given in Sect. 1.16. The resulting equations (1.121) and (1.124) govern 
to a good approximation flows in channels, lakes, rivers and seas, under the 
shallowness assumptions mentioned in Sect. 1.16. 


The core of mathematical models for flows on a planetary scale in the at- 
mosphere and in the oceans consists of the shallow-water equations, but the 
computing methods used are specialized, and a sufficient introduction would 
exceed the confines of this book. We will therefore restrict ourselves mainly 
to applications in hydraulics, and not go into geophysical fluid dynamics. 


More extensive introductions to computing methods for the shallow-water 
equations are given in Abbott (1979), Kowalik and Murty (1993), Vreugden- 
hil (1994). 


8.2 The one-dimensional case 
Governing equations 


In one dimension the shallow-water equations (1.121) and (1.124) reduce to 





at Ar 0, (8.1) 
ou | ,,dU | Od+H) . gUlU| _ 
at an to Oe + Gg 7"? (8.2) 


where U is the depth-averaged flow velocity, d the water depth, g the gravi- 
tational acceleration, H the height of the bottom, and C the Chézy friction 
coefficient. 
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Classification 
Some basic mathematical properties of first order systems like (8.1) and (8.2) 


have been discussed in Chap. 2. The general form (2.15) becomes in the 
present case 


“4 F =Q, (8.3) 


d U d 0 
w=(7): r=(( i) o=(_ja ja) 


According to Sect. 2.2, the type of (8.3) depends on the properties of the 
following generalized eigenproblem: 


with 


a 


(nol +niF)W =0. 
We have 

det(nof +nıF)=0 (8.4) 
if 


GET) ea 


Since we have found two real linearly independent vectors (no, nı), the sys- 
tem (8.3) is hyperbolic, according to Definition 2.2.2. 


Eigenvectors corresponding to the plus and minus signs in (8.5) are, respec- 
tively, 


W = (7°) , (8.6) 


g 


Characteristics 


According to Sect. 2.2 there exist in the frozen coefficient case wave-like 
solutions W = We'”(+-*) with 


Since 
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we see that w = constant along trajectories t = t(s), z = x(s) with 


dt dz 
— = — =U Fe. 8.7 
ds "ds + (8.7) 
Manifolds on which w(t, x) = constant were called characteristic surfaces in 
Sect. 2.2. In the present case these surfaces reduce to curves and are called 
characteristics. They are defined by (8.7), also in the nonlinear case. 


Diagonalization 


Let us introduce new unknowns Z such that W = W(Z) and let B denote 
the Jacobian matrix 


B = {dW/0Z} . 
Then equation (8.3) becomes 
ðZ an pô gy 
a +B FB =B Q. (8.8) 


It would be nice to choose Z such that B71FB is a diagonal matrix, because 
then the left-hand side of (8.8) decouples into independent single equations. 
This can be achieved as follows. The eigenvalues and eigenvectors of F have 
already been determined (see (8.4)-(8.6)). Choosing the columns of B equal 
to the eigenvectors of F we get 


_ fe -e -1_1f lfe Ig -inpp. {Ute 0 
p=(‘ ADE B = 3 ( th. 1/g }? BUFB= 0 U-ec)}° 


The new unknowns Z satisfy 





Oz; _ -1\. 
Ow; — (B dij 1 
which can be integrated to give 
l 1 
zı = z(e + U/2), z} = g E + UP?) :. (8.9) 
Equation (8.8) becomes 
dey ða pig, 9% ZE 
ga Utg TE Qo g U h)g TE Q. (8.10) 
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Linearized equations 


For future reference, we formulate a set of linear equations that approxi- 
mate (8.1) and (8.2) in the case of small perturbations. Let H = constant, 
d=d+d , d = constant, U = U + u, U = constant. To keep things simple, 
the friction term is replaced by ru. Deleting primes, equations (8.1) and (8.2) 
become 

Od -ðu -ðőðd 

— 4d— = 

att “Be + Ge =: 

ðu ðu Od 


This can be rewritten as 
Ow Ow 


ry + ae t W=0, (8.11) 


v(i) r= (FE) e= (88) 


Instead of linearizing around a constant velocity U we can also assume that 
U is small, or more precisely 


where 


Fr=V/vgd <1, 


where V is a measure of the magnitude of |U|, and Fr is the Froude number. 
Neglecting terms involving U? and Ud , equations (8.1) and (8.2) again go 
over in (8.11), but with 


w=(r) r= (93) e=). (8.12) 


If we neglect friction, this gives us the simplest approximation of shallow 
water flow. 


Waves 


Various kinds of waves may be distinguished; for a survey see Vreugdenhil 


(1994). 


Assume constant bottom height H and neglect the friction term, so that 
(8.10) becomes 


Oz Oz, _ Oz Oz _ 
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Hence zı = constant along the characteristic dt/ds = 1, dz/ds = U +, 
and z2 = constant along the characteristic dt/ds = 1, dr/ds = U — c. The 
quantities z1 and zz are called the Riemann invariants. We can look upon 
zı and zz as signals that travel with a speed U +c and U —e, respectively. 
Superimposed on the mean flow is a signal velocity +e that is caused by 
gravity only. We have wave-like solutions of a type called gravity wave. The 
velocity c = gd is called the celerity. The Froude number is Fr = U/e. 


Next, consider the linearized equation (8.11). Assume a harmonic wave of the 
form 


W = Well*t-“4) W = constant . (8.14) 


This is a solution of (8.11) if 


a 


(-iwl+7kF+G)W=0. 
There are nontrivial solutions if the determinant is zero: 
(ikU — iw)(GkU —iw+r)+h?e? =0. (8.15) 


This gives a functional dependence w = w(k). This is called the dispersion 
relation. As will be clear from equation (8.14), a non-zero imaginary part 
Im(w) corresponds to damping or amplification, whereas the real part Re(w) 
corresponds to propagation. The phase and group velocities are defined by 


Cp = Re(w)/k, cg = Re(dw/dk) . (8.16) 


If the phase velocity of a harmonic wave depends on the wavenumber k, the 
wave is called dispersive. A dispersive wave, if it is composed of different har- 
monics, deforms while traveling. Energy travels with the group velocity. For 
a nondispersive wave, Cg = Cp. 


Neglecting bottom friction (r = 0) in (8.15), we obtain the following disper- 
sion relation: 


w= k(U c), 
so that 
Cp =Cg=Ute. 


These waves are nondispersive, and are the gravity waves discussed before. 


The celerity c is seen to be the phase velocity in still water (U = 0). Because 
w is real, there is no damping. 


Let us now include bottom friction. Solving (8.15) for w gives the following 
dispersion relation: 
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w = kU —ir/2+ Ske? — 12/4, (8.17) 
If r > 2ke then Re(w) = kU, and we have a nondispersive wave with 
Cp = Cg =U. 


There is damping at a rate 


exp{-5r + fr27/4—ket}t . (8.18) 


Waves generated in rivers by heavy rainfall (called flood waves) have very 
long wavelength (i.e. k < 1). In this case 


Vr? /4— k??? = 5( — 2k? /r?) 


Taking the plus sign in (8.18), corresponding to the most weakly damped 
wave, we see that flood waves are damped at a rate 


exp(—tk?c?/r) . 
If r < 2ke then (8.17) shows that 
Re(w) = kU + y k?e? — r2 j4 
so that 


Cp =U + Ve? — 2 /4k?, cg =U 4e/ V1 — 72 /4k2e? . 


These waves are dispersive. As r | 0 the pure gravity waves are recovered. 


Boundary and initial] conditions 


Boundary and initial conditions that lead to a well-posed problem for equa- 
tions (8.1) and (8.2) are (obviously) precisely those which either explicitly or 
implicitly uniquely determine the Riemann invariants 21,2 in equation (8.13). 
Such conditions may be determined by considering the geometry of the char- 
acteristics, Let the domain be z € (0, L), t € (0, T). Figure 8.1 gives a sketch 
of the two families of characteristics. The Riemann invariant z, is constant 
along characteristics inclining to the right, for example AC, whereas z> is 
constant along the other family of characteristics, for example BC. Suppose 
that d and U are given at the segment AB. Then, according to (8.9), z1,2 
are known at AB. Along their respective characteristics Z1,2 are constant. 
Consequently, in all of the shaded region ABC both zı and z, and hence 
also the solution (d,U), are known and uniquely determined by the data on 
AB. It follows that changes in d and/or U along AB (or anywhere else in 
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Fig. 8.1. Geometric structure of characteristics of equations (8.1, 8.2). 


ABC) can influence the solution in C, but no influence is exerted in C by 
changes anywhere else. Therefore the shaded region ABC is called the do- 
main of dependence of the point C. Furthermore, the data on AB propagate 
their influence along the characteristics emanating from AB. The domain 
swept out by these characteristics is called the domain of influence of AB. In 
the present case, this is the region ADTEFB. 


Next, suppose (d,U) is given at t = 0. Then the solution is determined in 
OGL. In order to determine the solution outside OGL, boundary conditions 
are required at r = 0 and z = L fort > 0. At z = 0, t > 0, 22 is already 
determined by the data at t = 0. In order to determine the solution at z = 0 
one additional quantity must be given. The general rule is, obviously: 


The number of boundary conditions in a point on the boundary 
must be equal to the number of incoming characteristics at that 
point. 


Therefore, at x = 0 a condition independent of z3 must be given, such that 
zı is determined. For example, suppose that d(t, 0) is given. Hence, we know 
c(t,0) = y gd(t,0). Then z1(t,0) = Z(t, 0) + z2(t, 0) is known, and the so- 
lution is now also uniquely determined in the region OGHT. It is left to the 
reader to show that giving one suitable quantity at (t, L) determines the so- 
lution in the remainder of the domain. No condition is to be given along TE, 
because both characteristics are outgoing there. 
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Discretization 


We start with the simplest approximation to shallow water flow, namely 
equations (8.12) with G = 0. In order to obtain second order spatial accu- 
racy with a compact stencil that excludes spurious solutions (‘wiggles’) and 
facilitates accurate implementation of boundary conditions, the staggered 
grid of Fig. 8.2 is used. The use of a staggered grid was already proposed in 
1922 in the pioneering work of Richardson (1922). Staggered grids of this kind 


x=0 d U x=L 


0 1/2 1 3⁄2 2 J-1/2 J 
Fig. 8.2. Staggered grid for the shallow-water equations. 


are commonly used for numerical solution of the shallow-water equations. 


Let the mesh size be h = constant. We approximate d(t, x) in z; = jh and 
U(t, 2) in 2341/2 = (j + 1/2)h. We discretize (8.12) by finite differences in 
space in the following simple and natural way (postponing time discretiza- 
tion): 


dd;  -U;41/2 —U;- 

Li pB I _ 9 j= J1, (8.19) 
dt h 

dU; dj41— d; 

Siti? 4 gint 9 j=0,1,..,J—-1. (8.20) 


The system has to be completed with appropriate boundary conditions. Ac- 
cording to the rule given before, one boundary condition should be available 
at z = 0, and one at z = L. Consider x = 0. Implementation of a condition 
do = given is trivial. If instead U(t, 0) is given it is natural to redefine the grid 
such that 21/2 = 0, and again implementation is trivial. This is an advantage 
of the staggered grid. If instead one wants to stick to the grid of Fig. 8.2, 
then one may extend (8.19) to j = 0, and eliminate the virtual value U_1s2 
by means of 


U_ij2 = 2U (t, 0) — Uy /2 . 
The boundary z = L is handled similarly. 


In order to develop some proficiency in stability analysis for the shallow-water 
equations, we will present a number of simple cases. 
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Explicit Euler scheme 


By replacing the time derivatives in (8.19), (8.20) by a forward difference we 
obtain the following explicit scheme: 


dttd U7, — U" 

j iy qitu? 7 j—-1/2 _ 0, (8.21) 
n+1 n n n 
j+1/2 ~ UF 41/2 n diyi d; 

h Ih 


where the superscript denotes the time level. In order to study von Neumann 


stability, we postulate 
d” d" , 
j — 2 146 
(a) =C) (8.23) 


and find upon substitution in (8.21) and (8.22) (assuming the grid to be 


infinite) 
dnt d” 
(fons )=G( Gn) (8.24) 


with the amplification matrix G given by 


dr 
1 -T 2isin 6/2 
G=| 4, psn b/2 | (8.25) 


= 0, (8.22) 


The eigenvalues of G are 
A1,2 = 14 2io| sin @/2| , 
where 
o=er/h 


is called the CFL number, see Sect 5.6. Since there exist values of @ (in fact 
all 8 # 0, 27) for which |A;,2] > 1, this scheme is instable. 


Implicit Euler scheme 


This scheme is given by 


n+l n n+1 n+l 
de =d} Ui ja Ujin 
a dE a, 


n+l +1 1 
Uiti/2 — UF 172 4 git -d;* 


- 7 = 0. 
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It is left to the reader to verify, that the amplification matrix is given by 


—1 


dr 
—?2isinð/2 
G=| yr l p 5in0, (8.26) 
~— 2i sin 6/2 1 
h 
with eigenvalues 
à1,2 = (1+ 2io sin 0/2)7} . 
Hence 
lAl = (1 + 40? sin? 0/2) 71? <1, 
so that the scheme is unconditionally stable. 
Leapfrog scheme 
The leapfrog scheme (first used in Richardson (1922)) is given by 
1 n= n n 
dit -dj Un- U- _ 
a tie, 
T (8.27) 


n+l n-i 
Vira iriz 4 _ 
2r h 
The temporal accuracy of this scheme is O(r?), which is better than the 
Euler schemes. Because the scheme is explicit, the computing cost is low. 


A straightforward way to investigate the stability of (8.27) is to substitute 
(8.23) and (8.24), which gives the following equation for the amplification 
matrix G: 


G?-I+AG=0, (8.28) 


with 


0a Td , Tg 
A= = — = 1— sj . 
(5 0 ). a = 41 7 sin6/2, B= 4i i sin 6/2 


It is a tedious affair to determine G from (8.28). However, since we do not need 
G, but only its eigenvalues, we can proceed as follows. If V is an eigenvector 
of G with corresponding eigenvalue A, then 


(A? -1)V+AAV =0, 


so that V is also an eigenvector of A; let the corresponding eigenvalue be p. 
Then the eigenvalues of G are related to those of A by 
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M-14+prA(=0. (8.29) 
We have 
H1,2 = +4io| sin 6/2| . 


This gives us four eigenvalues A (taking all possible permutations of the signs): 


A1,2,3,4 = £2io| sin 0/2] + 4/1 — 40? sin? 6/2. 


The reason why we encounter four eigenvalues is that (8.28) has two solutions 
for G, each with two eigenvalues. If 


20|sin0/2| < 1 (8.30) 
then 
|A1,2,3,4{ = 1, (8.31) 


else there are A’s with |A| > 1. Hence for stability we must have (8.30) for all 
8, hence 


o<1/2, (8.32) 
so that the leapfrog scheme is conditionally stable, with stability condition 


h 


TS Je’ 


(8.33) 


As it turns out, the pioneering calculations in Richardson (1922) violate this 
stability condition, so that wrong results were produced. L.F. Richardson was 
ahead of his time. Stability theory for numerical methods for initial value 
problems starts with the celebrated paper of Courant, Friedrichs, and Lewy 
(1928). 


A somewhat different analysis of the stability of the leapfrog scheme is left 
to the reader in Exercise 8.2.4. 


Equation (8.31) shows that the scheme does not contain any damping. As a 
consequence, perturbations in initial and boundary data persist, which may 
degrade the results. Inclusion of the bottom friction term has a damping ef- 
fect, cf. Exercise 8.2.5. 


Because it involves three time levels, the leapfrog scheme needs an extra ini- 
tial condition compared to the differential equation. There are two linearly 
independent solutions, one of them essential, the other spurious. It depends 
on the implementation of the initial conditions how much the spurious solu- 
tion makes itself felt. Because of (8.31) the spurious solution is not damped. 
Therefore in practice the next scheme is usually preferred. 
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Hansen scheme 


This scheme has been popular for solving the shallow-water equations since 
the paper of Hansen (1956). It is given by 


141 _ gn yetl/2 _ path /2 
i 


j qit! ; rue gg, (8.34) 
T 
pnt3/2 _ n+1/2 qrti _ det! 
j+1/2 j+1/2 +g 7 j —0. (8.35) 
T 


Note that we now have also staggering in time. Because d”+! can be evaluated 
before U”+3/2, this scheme is effectively explicit. It shares with the leap- 
frog scheme the advantages of O(r?) accuracy and explicitness, but it does 
not have spurious solutions, because no initial conditions additional to those 
given for the differentia] equations are required. Also it requires less storage, 
because the solution needs to be stored at only two time levels. 


Implementation of initial conditions is easy. Let d? and U? be prescribed. 
We introduce the virtual values U~'/?, use (8.35) for n = —1 and eliminate 
U-'2 with 
p72 — ggz? _ prise , 
so that U1/? is found from 
1/2 _ gT 
Uii = Uja + 5p (G41 — 4) - 


A straightforward way to investigate the stability of (8.34), (8.35) is to sub- 


stitute 
d”? d" . 
J — . ijo 
En ) E (Goss 


quti d” 
( "n +3/2 ) =G ( Ọ”+i/2 ) ' 


with the amplification matrix G given by 


which gives 


G = G7 'G1, 


with 
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The eigenvalue problem for G is 


(Gz'Gi -ADV =0, 


or 
(Gi —AG2)V =0. 
From 
det(G1 — AG2) = 0 
we find 
M2 =l+ pt Sp? + 2p, p= 0° (cos0 -— 1). 
For 
—2< p< 0 (8.36) 
we find 
lAi2|=1. 


For p < —2 we have |Aq| > 1 for certain values of 6, so that for stability it is 
necessary and sufficient that (8.36) holds for all 6. Hence we must have 


o<l, (8.37) 
so that the Hansen scheme is conditionally stable, with stability condition 


r<h/e. (8.38) 


Sielecki scheme 


This is another well-known scheme. It is similar to the Hansen scheme, but 
does not employ staggering in time. It has been proposed by Sielecki (1968), 
and is given by 


dvr) -dj 4 Uru Uj- 


J = 0 
h 
n+ n n n 
Ua Ume Gh- G 
r 9A = 0 


It is left to the reader to show that the amplification matrix G is the same 
as for the Hansen scheme. Hence, the Sielecki scheme is stable if (8.38) is 
satisfied. Because d"+! can be evaluated before U"t1, the scheme is effec- 
tively explicit. Because the scheme is not staggered in time, accuracy is only 
O(r), and is therefore less attractive than the Hansen scheme for the simple 
equations under consideration. But in the presence of Coriolis acceleration 
(in two dimensions) and atmospheric pressure effects, the Sielecki scheme has 
certain advantages, cf. Sielecki (1968). 
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An implicit scheme 


Because explicit schemes usually have to obey a stability conditions, which 
may be too restrictive in practice, implicit schemes are often used. The im- 
plicit Euler scheme has O(r) temporal accuracy, which is generally insuffi- 
cient. A straightforward example of a more accurate implicit scheme is: 


dH- ž d . 
n+1 ny\g+1/2 _ 

At gpl Uia, 

"a2 Ua g 

J 3 J_/m+l1 n iti 

AEU IE Batt arji =o. 


(8.39) 


The temporal accuracy is O(r?). 


We now have a system of equations to solve for each time step. If we order 
the unknowns as follows: 


+1 n+i n+l n+l 


then the structure of the matrix A of the system to be solved is tridiagonal. 


For the stability analysis we postulate (8.23), which results in 


and the amplification matrix is given by 
G = G{'G2 . 
We find that G and G2 are given by 


with 


dr . gT 
y 58/2, B= i= - 


As before, A(G) follows from det(G2 — AG,) = 0, resulting in 


ari 


sin 6/2. 


(1 + io| sin 6/2])? 
1.2 = 7 
1+o?sin* 6/2 


so that 
[A1,2| =1. 


Hence, the scheme is unconditionally stable and contains no damping. 


8.2. The one-dimensional case 319 


Leendertse scheme 


This scheme, proposed by Leendertse (1967) for the two-dimensional nonlin- 
ear shallow-water equations, is a two-step scheme. In the present simple case 
it reduces to: 
+1/2 n+1/2 _ ppn+1/2 
d; | Ui sain — Uj-1j2 _ 
++ +d =0, 
7/2 
n+1/2 n+1/2 m+1/2 
Ojrajz — Uir git G o 
T/2 h i 
+1 n+1/2 n+1/2 _ n+1/2 
dit =d giin an _ 4 
7/2 h 
+1 n+1/2 +1/2 
Us — U;41/2 p oe _ d 


=0. 
7/2 h 


An implicit Euler step is followed by an explicit Euler step. The amplification 
matrix is the product of the two amplification matrices (8.25) and (8.26), with 
T replaced by 7/2: 


E 7 E 1 —id= sin 0/2 
G=G2G;", G2= ( _igz sino 1 
E 1 id} sin 0/2 
Gis ( igs sino /2 1 | (840) 


The eigenvalues A(G) follow from det(G1 — AG2) = 0, giving 
(1— à)? + (14 A)?o? sin? 6/2 = 0, 


so that 


l] + iø sin 6/2 
ài = — Al 
L2 1 io sin 6/2 ’ (8.41) 


from which it follows that |A1,2| = 1, so that we have unconditional stability, 
and there is no damping. 


Dissipation and dispersion 


By postulating a harmonic wave W = W exp{i(kz — wt)} for the linearized 
shallow water equations (8.12) without bottom friction we obtain the follow- 
ing dispersion relation: 


w = tke. (8.42) 
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By assuming a harmonic wave for a numerical scheme we also get a dispersion 
relation. For example, for the Hansen scheme we may postulate 


d; = dexp{i(kjh — wnr)}, ale = Uexp{i(kjh —wnr)} . 


Substitution in (8.34) and (8.35) results in 


eer —1 zt (1 — e~h) d o. 
TI (e'Fh _ 1) 1 — e”7 U 
By requiring the determinant to be zero the following dispersion relation is 
obtained: 


w = +2 arcsin(o sin kh/2) . (8.43) 
T 


According to (8.37), for stability we must have ø < 1. This makes w real, so 
that there is no numerical dissipation. In order to approximate a wave with 
wavenumber k we must have h < 1/k. Under this assumption (8.43) can be 
approximated by 


w & tke{1 — al —o*)k*h?} . (8.44) 


Hence, using a superscript A to distinguish numerical from exact quantities, 
the numerical phase and group velocities (see (8.16)) are given by 


cf = te{1 — all —o*)k7h?}, cf = +e{1-— (l —o*)k?h*}, (8.45) 


showing O(h?) accuracy, as expected. 


Unlike the exact gravity waves, the numerical approximation shows disper- 
sion, unless g = 1, in which case the Hansen scheme has no error. The error 
for the group velocity is larger than for the phase velocity, which is a pity, 
because energy travels with the group velocity. 


We briefly present dispersion analysis for the other (stable) schemes. 


The dispersion relation for the Sielecki scheme is the same as for the Hansen 
scheme. 


It is left to the reader to show the implicit Euler scheme has the following 
dispersion relation: 


eT = 1+ Yiosinkh/2 . (8.46) 


Writing w, = Re(w), w; = Im(w), we see there is damping at a rate 


eT = 1/4/1 + 407 sin? kh/2. 
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Because the differential equation contains no damping {according to (8.42)), 
this numerical damping makes implicit Euler unattractive. According to 


(8.46), 
1 . 
wp = t—arctan(o sinkh/2) . 
T 
Approximation for kh & 1 gives 
wr S +ke{1 — =l + 80? )k?h?} . 


1 


ch = +e{1— (14 807)k7h7}, ch = +e{1 — 5 (1 + 807)k7h?} . 


Comparison with (8.44) shows that the numerical dispersion of implicit Euler 
is significantly larger than for the Hansen scheme. 


For the leapfrog scheme the dispersion relation is found to be 
] . . 
w = +- arcsin (2o sin kh/2) . (8.47) 
T 
We have stability only if o < 1. We have propagation (w real) only if 
|20 sin kh/2| < 1, in which case is no numerical dissipation. Approximation 


for |kh| < 1 gives 


w = the{1 — = — 40?)k?h?} , 


cå = +e{1 — zl ~ 407) k*h?} , cå = +ce{1- 5 (1 — 40°) k*h?} , 
which is a little less accurate than the Hansen scheme (cf. (8.45}). 
For the implicit scheme (8.39) the following dispersion relation is obtained: 
w= +2 arctan (ø sin kh/2). (8.48) 


Under all circumstances, w is real, so there is no numerical dissipation. Ex- 
pansion for kh & 1 gives 


1 
w = tke{l— a (1+ Qo?) k7h?}, 
from which it follows that 


how 1 1 
ch = e{1 — aq (it 20°)k?h?}, ch = te{1- gil + 207) k*h?} , 
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which is less accurate than the Hansen scheme. 


The dispersion relation of the Leendertse scheme may be determined in the 
following way. Postulate 


d? — dr e'ikr, UP = Ü” taka 


d”+1 4 d” 
(dan )=063 (a) 


with G1, G2 given by (8.40) with 0 replaced by kh. Assuming d = de~*”"7, 
U” = U e-™"7, we find a solution if 


Then we find 


A 


(GGI — e7™" I) ( k ) =0. 


The eigenvalues of GGT} are given by (8.41), so that 


_ ltiosinkh/2 
~ lziosinkh/2 ~ 


wT 


The modulus of the fraction equals 1, so that w is real, and we have no 
numerical dissipation. The dispersion relation is 


w = +2 arctan(o sin kh/2) , (8.49) 
T 


which is the same as for the Hansen scheme (equation (8.43)). We may con- 
clude that the accuracy of the Leendertse scheme is O(r? + h?). 


Exercise 8.2.1. At what speed do the waves travel caused by a stone thrown 
in a pond of depth 1 m? 


Exercise 8.2.2. Derive the Riemann invariants z1, z2 for equations (8.11), 
(8.12). Give a discretization of the boundary condition z; (t, 0) = given for the 
explicit Euler scheme. Similarly for z2. Why is it mathematically wrong to 
prescribe z2(t, 0)? Perform numerical experiments and see what goes wrong 
with prescribing z(t, 0). 


Exercise 8.2.3. Solve equations (8.11), (8.12) numerically with the schemes 
discussed. Prescribe initial and boundary conditions corresponding to the 
following exact solution for the case without bottom friction: 


d=dsin(t —2/c), U =csin(t — z/c), 


and study the numerical error. What is the numerical propagation velocity 
of gravity waves? 
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Exercise 8.2.4. Alternative stability analysis of the leapfrog scheme 
Analyze the stability of the leapfrog scheme by postulating instead of (8.23) 
and (8.24): 


dtti d” 
Url U” 
d” =G qd”-1 
a” pn-l 


Determine the matrix G and its eigenvalues, and derive (8.32). 
Hint: Consider a partitioned matrix 


Al 
c=(7 SE 


where the blocks of G are square and of equal dimension. Let the correspond- 
ing partition of an eigenvector v be 


vı 
v = ; 
v2 


and let A be the corresponding eigenvalue. Since Avy + v2 = Avy, vi = Ava, 
vı and v2 are eigenvectors of A, so that À is related to the eigenvalues p(A) 
by (8.29). 


Exercise 8.2.5. Consider the linearized shallow-water equations (8.11), (8.12) 
including bottom friction. Carry out stability analysis for the leapfrog scheme, 
and show that bottom friction has a damping effect. 


Exercise 8.2.6. Make a plot of the dispersion relation (8.43), of the ap- 
proximate dispersion relation (8.44) and compare with the exact dispersion 
relation (8.42) 


Exercise 8.2.7. Derive the dispersion relations (8.46) and (8.48). 


8.3 The two-dimensional case 


Governing equations 


The two-dimensional shallow-water equations (1.121) and (1.124) can be writ- 
ten as 


ôd | adUy | dU, 
dt On, Ox, 
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where U and Uz are the depth-averaged velocity components, d the wa- 
ter depth, g the gravitational acceleration, f the Coriolis parameter and C 
the Chézy friction coefficient. The importance of the Coriolis acceleration is 
measured by the Rossby number, defined by 


where V is a measure of the magnitude of the velocity, and L a typical length 
scale. 


Classification 


As in the one-dimensional case, we determine the mathematical type of (8.50) 
along the lines set out in Chap. 2. The system (8.50) may be written as 


aw, p OW | OW _ (8.51) 


with 


d U, d 0 U, 0 d 
W= (U7, HM=[¢ UU 0], Rss| 0 UW 0 |, 
U 0 0 U g 0 Uy 


0 
fUz — gU |U |/C?d — gdH/dz, . 
—fU, — gU2|U|/C*d — gð H /ðz3 


Il 


Q 


Following Sect. 2.2, the type of (8.51) depends on the properties of the eigen- 
problem given by 


det (nol + ni Fi + neF2) = 0, (8.52) 
which gives 
fig{hg — c?(n? + n2)} =0, (8.53) 
where 
ño = no + niU + ne, eè =gd. (8.54) 


Three linearly independent solutions of (8.53) are given by 


fio =0, fig = 4ey/n? +n. (8.55) 


Equation (8.52) arose in Chap. 2 from postulating a solution of the form 
W = We” (te), The equation for W is 
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a 


(nol +nifi t+ noF,)W =0. (8.56) 


Corresponding to the three cases of (8.55) we find the following three solu- 
tions, choosing nı = cos 3, nz = sin 8 with 8 arbitrary: 


0 c —c 
W, = sin f , Ŵ= | -geosf |, W3 = | —gcos8 |. (8.57) 
—g cos 3 —g sin 8 —g sin 8 


Since we have found a full set of eigenvectors of (8.56), the system (8.51) is 
hyperbolic according to the classification scheme discussed in Chap. 2. 


Types of waves 


In Sect. 8.2 we have, for the one-dimensional case, formulated a rule to deter- 
mine appropriate boundary conditions based on the number of characteristics 
that enter the domain. In the two-dimensional case, appropriate boundary 
conditions can be selected by working with wave fronts (characteristic sur- 
faces). We take a closer look at the solution W = Wie’*), With W given 
by (8.57) this is a solution of (8.51) with Q = 0 if 


Ow Ow Ow 


— = — = — = 8. 
a"? Ja V da, (8.58) 


with no, n1, ng chosen according to (8.55), corresponding to the choice made 
in (8.57). Surfaces where W = constant, i.e. w(t,#) = constant, are called 
wave fronts in Chap. 2. 


According to (8.55), we have three kinds of waves. For no = 0 it follows from 
(8.54) and (8.58) that 


Ow Ow Ow 
a + be, + oe, = 0, 


so that this wave moves with velocity (U1, U2). From (8.57) we see that for 
this type of wave, d = 0. The vorticity satisfies 


w = OU, /Ox, — OU, [0x2 = ie (— cos aoe — sin ao) = =i" . 
1 2 


Hence, the vorticity is constant along wave fronts. Therefore this type of wave 
is called a vorticity wave. It reprensents transport of vorticity by the flow. 
The second kind of wave is obtained for ño = cy/n? + nż, or 

no = ¢ — Ui cos 8 — Ua sin 8, ni = cos, no=sing. 


From (8.58) it follows that 
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~ + (Uj, - cos 8) S + (Uz — esin A) =0, 

so that this wave moves with a velocity which is the sum of the flow velocity 
and a gravitational phase velocity vector in the direction (— cos , — sin 2). 
The third kind of wave is obtained for ño = —eyn? + n2. It is left to the 
reader to show that this wave moves with velocity (U1 +ccos 8, Ua +csin B). 
Since @ is arbitrary, there is no need to distinguish between the second and 
third kind of waves. These are called gravity waves. Summarizing, we have 
found two kinds of waves: vorticity waves, moving with velocity (U1, U2), and 
gravity waves, moving with velocity (U; + ccos 8, Uz + csin 8), 8 arbitrary. 


Waves in the linearized case 


We now keep part of the right-hand side Q in (8.51), but linearize. Under 
the same assumptions as in Sect. 8.2, the two-dimensional linearized shallow- 
water equations are given by 

Ow Ow Ow 


—. — + Fa — = . 
ðt + iJe, T 25, TOW 0, (8.59) 


where 

k d Ur d 0 U2 0 d 

W = ui ; Fj = g Ui 0 f F, = 0 U> 0 ; 
u 0 0 U 0 U 
2 1 g 2 (8.60) 
00 0 

G=|0r -f 
Of r 


Assume a harmonic wave of the form 
W = We'l#=-“t) W = constant , 
where k = (kı, k2), æ = (£1, £2). This is a solution of (8.59) if 
(—iwl + iki F, + ike, + G)W =0. 


There are nontrivial solutions if the determinant is zero. Neglecting bottom 
friction (r = 0), this results in: 


i3(O? — k?e? ~ f?)=0, k= hk? +k?, © =w- kÜ — k0. 


The root & = 0 corresponds to the vorticity waves. The other roots are given 
by 


w =A V/ctk? + f?. 
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The corresponding waves are called Poincaré waves. They are dispersive, with 
phase and group velocity components 


c 
C 


w/ka = Ug belt f?/k?e?, 
Ow /Okg = —Ug + ey/1 + f2/k2e? . 


2 QE Q 


The pure gravity waves are recovered if f/kc < 1, or 
Bu = c*k?/f? > 1, 
where Bu is called the Burger number. 


At moderate latitudes, according to (1.122), the Coriolis parameter satisfies 
f & 2m per 24 hrs. We have f/kc = 1 if the wavelength L = 27/k satisfies 
L = 2ne/f, which is, with the assumed value of f, the distance traveled by 
a gravity wave in 24 hours. In shallow water, for example d = 20 m, we 
have c = 14 m/s, so that L = 1200 km; in deep water, for example d= 2 
km, we have c = 140 m/s and L = 12,000 km. We see that for large scale 
geophysical fluid dynamics the effect of Coriolis acceleration is not negligible. 
As mentioned before, its importance is measured by the Rossby number Ro. 
If Ro < 1, Coriolis acceleration dominates. The effect of bottom friction on 
wave motion will not be considered, because it has significant influence mainly 
in river flows, which can be modeled by the one-dimensional shallow-water 
equations; this case has already been examined in Sect. 8.2. 


Boundary conditions 


As initial condition, U must be given. The generalization to more dimensions 
of the rule for the boundary conditions found in Sect. 8.2 is: 


The number of boundary conditions in a point on the boundary of 
the domain must be equal to the number of different types of waves 
that can enter the boundary at that point. 


As an illustration, consider a left boundary zı = 0. Two (of the many) pos- 
sible gravity waves correspond to 8 = 0,2. These propagate with normal 
velocity component U1 +c. The vorticity enters or leaves with normal ve- 
locity component U1. This gives us the cases listed in Table 8.1, where n 
is the number of possible incoming waves and also the number of boundary 
conditions to be prescribed. The next question is, what to prescribe? For 
the gravity wave with 8 = 0, only d and U; are nonzero, according to (8.57). 
This reduces the governing equations to the one-dimensional case of Sect. 8.2, 
which tells us the following. In the subcritical cases, a quantity independent 
of U; — 2./gd must be prescribed, such as U,,d or Uy + 2/gd. In the super- 
critical inflow case, both U, and d must be given. For the vorticity wave, only 
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U; < —c upercritical outflow 
—e < Uı <0 | Subcritical outflow 
0<U <c | Subcritical inflow 
Supercritical inflow 







Table 8.1. Number of boundary conditions (n) to be prescribed. 


U; and U2 are nonzero, according to (8.57). Since U, is already determined 
by the incoming and outgoing gravity waves, Uz must be prescribed if the 
vorticity wave is incoming (U; > 0). 


Often, a part of the domain boundary is artificial, and is introduced merely 
to restrict the domain of computation, for example, to separate the North 
Sea from the Atlantic. Then it is important that this boundary, as far as pos- 
sible, does not generate artificial numerical wave reflections. It then becomes 
important what combinations of d, U and U2 are prescribed. Various weakly 
reflecting boundary conditions have been developed. We will not discuss these. 
Information may be found in Engquist and Majda (1979), Verboom and Slob 
(1984), Vreugdenhil (1994). 


Choice of grids 


Figure 8.3 shows the staggered grid proposed by Platzman (1959) and used by 
Hansen (1956) and Leendertse (1967). The velocity component U; is located 


Fig. 8.3. Staggered grid 


at grid points (j+1/2,k), Uz at (j,k+1/2) and dat (j, k). Of course, there are 
different ways to assign unknowns to grid points, including colocated schemes. 
Five different configurations are studied in Arakawa and Lamb (1977), and 
are generally known as Arakawa grids A, B, C, D, E. The grid of Fig. 8.3 is 
the Arakawa C-grid. In a comparative study, Randall (1994) finds that the 
C-grid gives the best approximation of the dispersion relation. Here only the 
C-grid will be used. Several schemes will be presented with their stability 
analysis for the linearized shallow-water equations (8.59), (8.60) with U; = 
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U, = 0, G = 0. Numerical dispersion will not be analyzed, because the 
one-dimensional results of Sect. 8.2 give sufficient indication of the relative 
merits of the various schemes in this respect. Furthermore, a scheme that is 
frequently used in practice for the nonlinear equations (8.50) will be discussed. 
The linearized equations will be denoted for convenience by 


d 0d 0 00d 
W=(U], M=alg00)], h=l[000]. 
V 000 g 00 


The (uniform) mesh size in the z-direction is denoted by ha. 


Hansen scheme 


The one-dimensional version of this scheme has been proposed in Hansen 
(1956). The scheme is staggered not only in space but also in time, and is 
given by 
+1 rd n+1/2 n+1/2 rd n+1/2 n+1/2 _ 
dig — ik + h, Vitija ~ U; ijk) + hy! jk+i/2 isk 1/2) = 0, 
n43/2 n+1/2 TI, n+l 41) _ 
U si/a.k — U; 41/2, n, +k 7 dik ) =0, 
n43/2 n+1/2 , T9; m41 +1) _ 
Vi et1/2 ~ Vis ett /2 + fy k+ T di ) =0. 


Stability analysis is standard, and similar to the analysis of the one-dimensional 
case in the preceding section. We postulate 


hn an 
Uigi/ak | = | Ott? | efi the) 
n+1/2 prtl/2 
jk+1/2 
and find 
drt d” 
[ynt3/2 =G Ọr+1/2 , G= G3 'Gı , 
yrt3/2 yrti/2 
with 


1 -—f; -bz 1 0 0 
0 0) 1 ma 01 
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where 
gT , ip dr —i8 
=>— (em — 1), Bm = 7—(l—e m) m=1,2. 


Qm = 
hm m 


From det(G ; — AG2) = 0 it follows that 
(1—A){(1—A)? — 2A} = 0, p= 0? (cosO, — 1) +.02(cos@z~1), (8.61) 
where om = cT/hm, m= 1,2. For 
—2<p<0 (8.62) 


we find that the roots of (8.61) satisfy |A1,2,3| = 1, otherwise there are roots 

with modulus > 1. Hence (8.62) is sufficient and necessary for stability. The 

condition (8.62) is to hold for all 801,82, so that the stability condition is 
hihz 


ey/h? +h? 


Oyto2<1 or r< (8.63) 


An implicit scheme 


The two-dimensional version of the implicit scheme discussed in Sect. 8.2 is 


n n rd n\|i+1/2,k rd n n\jj,k+1/2 
dr; de + 5 ene tia t TA (Vet + Vet = 0, 
“ition ane + 37 th ao (nt) +d") he" = = 0, (8.64) 


n n n ue 


Stability analysis proceeds as follows. We postulate 


a my 
Uruz | = | UP eh fitkta) , (8.65) 
Vik+1/2 ve 
qvti d” 
U"t1 |} =G| on |, (8.66) 
yrtl on 
where G = G7'Gi, with 


l -a, -a 1 ai à? 
G=|-A 1 0 |], @=({4 101), (867 
— Bo 0 1 Bo 0 1 


and find 
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where am = (Td/2hm)(1 — exp(—i0m)), Bm = (7g/2hm)(exp(iOn) — 1), 
m = 1,2. Although the roots of det(G, — AG2) are easily determined, as 
before, we wish to demonstrate a different technique, that wil be useful later. 


The matrices G4 and Gz can be symmetrized by a similarity transformation 
with the diagonal matrix D defined by 


1 0 0 
D= [o /g/deis/? 0 . (8.68) 
0 0 V/9/de'/? 


We find 


. l =y =y 
DIG, D =Gi= | -y 1 0 ; 
-y 0 1 
(8.69) 


. 1 y %2 
DiG, D =G2= |y% 1 0 ; 
Y2 0 1 


where ym = 10m sin 6n/2, Om = TC/hm, m = 1,2. Because Ym is imaginary, 
Gı and G2 are adjoint: 


G,=G*, (8.70) 


where A = BH means aij = bi, Re(b;;) = Re(d;i), Im(b;;) = —Im(};;)- 
Furthermore, G2 is normal: 


G.GH = GEG. (8.71) 
With (8.70) and (8.71) we find that G = Gz1G; is unitary: 
GGE = I, 


so that |A(G)| = 1. Since A(G) = \(G), we have proved unconditional stabil- 
ity. The scheme has no damping. 


Leendertse scheme 


In order to reduce computing work, the implicit scheme may be approximated 
by stages that are implicit in one direction only. This results in the following 
ADI (alternating direction implicit) scheme: 
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nti/2 om | TE nar jajti/ak , TE nij ktij _ 
dig — dip + g Uit 1/2,k t 5p Thy Dk- 1/2 — 0, 


n+l 2 n 1,k — 
Ujak — Uitzet anit = 0, 


yore _ 


n jk+1l 
jk+1/2 Tha t op -d Gk =0, 


(8.72) 
1 mtij2 Td 2jj+L/2k z n+lijk+1/2 _ 
dig — dik gim ize + 2h, PERRE 1/2 — 0 


n+l n+1/2 n +1,k 
Urio, k -Uart k + oh; =f 4 12 0 Lo, 


+1 n+1/2 +1)3,4+1 _ 
Vik+1/2 7 Vik+1/2 + at lik = 0. 


The time step is divided in two parts. In each part, only tridiagonal systems 
have to be solved, along horizontal or vertical grid lines. This is considerably 
cheaper than solving (8.64). The method has been designed such that O(r?) 
accuracy is maintained. The scheme has been proposed in Leendertse (1967). 


For the stability analysis of (8.72) we follow Stelling (1983). The principles 
are the same as before. By postulating (8.65) and (8.66) we find 


G = G7 'G3G3 Gi, 


with G7'G the amplification matrix of the first half step and G7 +G; that 
of the second. We find 


l 0) —a3 1 ay 0) 
Gi = 0 1 0 : G = fy 1 0 ; 
o 0 1 [s 0 r) 
1 —-a1 0 1 0 Q? 
a= ( -a 1 o). Ga=z=| 010], 
0 0 1 Bo 0 1 


with am and fm defined after equation (8.67). We symmetrize with the ma- 
trix D given by (8.68) and find 
G = DGD = G7z!GsG7'Gi, 

with 

1 0 -y . l 7 0 - g 

Gi = 0 1 0 s G@G=| yn 10], G= Gf, Gz=G#, 

001 
(8.73) 


where Ym is defined after equation (8.69). We have G = Gy 1 GEG’, 
hence 
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GĦ GGI" = GFGy'GiGy". (8.74) 
G, is normal, therefore 


(GGI GGI)” 


GGT” Gy'G? = Gi(GiG#)-1GF 
G (GE Gi) IGE =] 


il 


showing that GiGi” is unitary. Similarly, GH Gs l is unitary. Then it follows 
from (8.74) that GĦGGI*” is unitary, so that |\(G)| = 1, proving uncondi- 
tional stability. The scheme has no damping. 


Disadvantage of ADI schemes 


We see that without compromising the local truncation error or stability, we 
have replaced the implicit scheme (8.64) by the much cheaper ADI scheme 
(8.72). But ADI schemes have a disadvantage that the implicit scheme does 
not have. It turns out that, depending on the geometry of the problem, to 
achieve sufficient accuracy the time step + must be much smaller for the ADI 
scheme than for the implicit scheme. This is a pity, because the main moti- 
vation for unconditionally stable schemes is to allow large r. The efficiency 
advantage of ADI is lost if rT is severely constrained. 


The loss of accuracy of ADI in domains of certain geometric shape may be 
explained as follows. Physical disturbances travel with the celerity c. For ac- 
curacy it is necessary that changes in the numerical solution propagate with 
a velocity that is approximately equal to c. That is, they should travel over 
a distance of approximately cr in one time step, that is a distance omhm 
in the z,,-direction. Now assume that the domain is the zig-zag channel of 
Fig. 8.4. A change in the numerical solution in P influences the whole hor- 


IW 


SS 


izontal row of cells in which P is located in the first half time step. In the 
second half step, all vertical rows of cells that intersect the previous horizon- 
tal row are affected. The numerical solution in other cells is not influenced. 


Fig. 8.4. Zig-zag channel 
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Hence, a change in the solution in P affects only the shaded area in Fig. 8.4, 
in one time step. As a consequence, if c} and c} are so large that a physical 
disturbance in P travels much beyond the shaded are over a time-span T, 
the ADI scheme is inaccurate. This is illustrated by applications worked out 
in Stelling, Wiersma, and Willemse (1986), in which the numerical propaga- 
tion speed of tidal waves depends entirely on the geometry of the spatial and 
tidal flats, in combination with the spatial and temporal step sizes. This is the 
reason why in Wilders, van Stijn, Stelling, and Fokkema (1988) an implicit 
(non-ADI) scheme with an efficient iterative solver is developed. However, if 
the shape of the domain does not inhibit signal propagation, or if the time 
step has to be limited for other (accuracy) reasons anyway, then ADI schemes 
are attractive. 


Stelling scheme 


To illustrate how the non-simplified shallow-water equations may be solved 
numerically, we present details of a discretization scheme that has found 
widespread use in practice. It is a further development of the Leendertse 
scheme (Leendertse (1967}), and is described in Stelling (1983). The scheme 
is of ADI type. Coriolis acceleration, bottom friction, nonlinear advection 
terms and an interior viscous term are included. The spatiai grid is staggered 
and of Arakawa C type, as before. The location of the unknowns is given in 
Fig. 8.5. The governing equations are (1.121) and (1.124) with inclusion of 


H U H 
U § W 
H U2 H 


Fig. 8.5. Location of unknowns in staggered grid 


an interior viscous term: 
0g 4 ðdU adv _ OH 
at Or zə E Gt’ 








aU, pU „əU o£ gU|U| 277 

Ji +U am ony t an, T + C2d —vV*U = 0, (8.75) 
ôV av av ak gV |U] a 

a oe tV am Ie IU + Tea Y V 


where V? = 0? /ðxz? +8? /Oxz3, d = €—H and |U| = VU? + V?. For simplicity 
we assume ĝH/ôðt = 0. 


A time step is divided in two stages, in ADI fashion: 


8.3. The two-dimensional case 


Stage 1: 
2 n n n 1 2,k 1 n n ik 1/2 
=(é 41/2 _ £ ); a+ 7 la +1/277 tapt + PAG V ee 
2 eon n PyPtl/2 pynjj+3/2,k 
=U +? y )j+1/ak + a = UF Nia. sU eae + 


l n+1/2 j+1/2,k+2 njj+1l/2,k+1 
12h, j+1/2,4 "pizz + AU Gaay2k—1) + 


l on j+1,k n+1/2 
a take — fV 41/2, k t 


gU" HAU) + (VPP PP (Cd) 4172.6 — VVU 12, = 0. 


2 n Th Th 
(yn tie — V” )jk+1/2 + Ajegis2(U™, Vt?) + 


yrrl/2 I R+3/2 at weet f 


1, n 
Thy Vikt1/2 j,k~1/2 Ujk+1/2 


n n n n n 2 — 
talv” HU”) + (V”) T: Cd) mera — VVRV EEN, = 0. 
Stage 2 


n n n n 2,k 
(ent _¢ #1/2) y a +1/277 +2) rt 


3a) bo 


paynet 0 


al S|- 


yrti _ yntt/2) prti/2 UHLÍ +3/2;k 


1 
j+1/2,k + hy 5p i 41/2,4 j—1/2,k 


(Vee? ptt) 4. gentle pyri + 


+Bj41/2,k 
aun uray (VP+1/2)2} 1/2 /(C2gntH/2)) 12k — 

vV?U? ak = = 0, 

“(ve — VPH?) i apt 


1 +1 n+1/2;3+2,k+1/2 1/2,3+1,k+1/2 
12h, Us eei/a(V / j —2,k+1/2 +4y"* / aa’ k+1/2) + 


1 ik 
+1 +1/2,j,k +3/2 +1jj,k+1 prt 
Thy Akty V" / kaya) + hə A be + FU jik+1/2 


tavern? + (VME 222 (CAGED app — 


"era = 


335 


= 0, 


Here V? stands for the standard discrete Laplacian. The operator A is defined 


by 


336 8. The shallow-water equations 


Ak (Y, Z) = Y;k(3 Zk — 4Z;-1,k + Z;~2,k)/2h1 if Yjk >00, 
(8.76) 
Ajk(Y, Z) = Yjr(—3Zjk +42j41, ~ Zj42,6)/2hi if Yj <0. 


The operator B is defined similarly, but with differences in the x-direction. 
Quantities that have to be evaluated at points where they are not defined are 
obtained by straightforward averaging, for example, 


1 
d341/2,4 = 5 (Sik + Ej4ijk — Ajaijoegis2 — Hi41/2,k 1/2) - (8.77) 


The implicitness is carefully distributed over the two stages, such that mainly 
tridiagonal systems have to be solved. In addition, a few iterations are carried 
out for the nonlinear terms, the operators A and B and the viscous terms. 
The convection terms have been discretized such that second order accuracy 
is obtained over a complete time step, with a certain amount of upwind bias 
to avoid spurious oscillations, while keeping numerical dissipation and dis- 
persion small. 


Implementation of boundary conditions is straightforward, except for the con- 
vection terms, which need special care. For details, see Stelling (1983), where 
also procedures to follow when d becomes zero (tidal flats) are described. 


A non-ADI version, to be used when ADI is inaccurate due to the geomet- 
ric effects discussed before, is presented in Wilders, van Stijn, Stelling, and 
Fokkema (1988), including discussion of efficient iterative solution methods. 


Stability analysis of Stelling scheme 


To give an example of Fourier stability analysis for a more difficult problem 
than before, we discuss stability of the Stelling scheme. The ‘frozen coefficient’ 
version of (8.75) is, assuming H = 0, and simplifying the bottom friction term 
as in Sect. 8.2, 


dd {AU ƏV\ -ôd -ôd 

a d | —— —_— —— — = 

at (= N tU Sz t "Oe. 

ðU ._dU -U  ðd 

— 4 7a a a“ _ yw = 
a tUa eo fV +rU —vVU =0, 


OV -ƏV -8V ôd 2 
The frozen coefficients are denoted by an overbar. Let U > 0, V > 0. By 
discretizing with the Stelling scheme and postulating (8.65) and (8.66) we 
find: 


GW” +12 = GIW”, GqW"tt=G,wr?, W= (å Ù, ¥)T, 
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with 
1-2 0 _ 702 
G= 0 L= y -h + V? 0 ; 
— (39 -f 1 
l+y &ı 0 
Go = fy l+rr/2 _ —f ` ’ 
0 0 1+A+yo4+rr/2—V? 
(8.78) 
l- y —ay 0 
G3 = —ĝı 1 f ; , 
0 0 l- y — å + V? 
1+ 2 i 0 . a2 
G4 = 0 1+B+y+rr/2- V? 0 A 
Bo f l+rr/2 
where 
rd ~i8 TY ; i 
= —(] — 10m = —>— (em _ = 1.2; 
am Ih, (l-e ), Bm oh, (e 1), m 2; 
iv 8 —— sin, ô irU —— (sin 26, + 4sin 81), 
yı = oh mei, Y= 2, 01 = oh 1 1 
3 = ih (sin 262 + 4sin 62), f= La Her peT y ef (91-82) 
— —iĝi — 218, D TV _ —ibs — 2109 
A= ii, TU g- 4e +e ) B= Th; — (3 — 4e +e ) 


V? = vr{(cos@, — 1)/hf + (cos 6 — 1)/h?}. 


Similarity transformation with the diagonal matrix D given by (8.68) results 


in 
; l~y 0 -u . l+ Hi 0 
Gi = 0 g 0 , G= fy l+rr/2 -f ], 
-H -f 1 0 0 92 
_ 1-71 -p1 0 - l+72 0 H2 
G3= | -m 1 f |, G= 0 g4 0 , 
0) 0 43 H2 f l1+rr/2 
where Gm = D'G,D, m = 1,....4;5 g1 = L= y — 62 + V3, 


g2 = 1+A+y2+rr/2— V2, g3 = 1- vy 81 +02, g4 = 1+B +7 +rr/2— %2, 
[im = 10m Sin m /2, m = 1,2. The amplification matrix G satisfies 

G = Gy'GsGy'Gi = DGZ'G3Gz'G,D7) 
(GaD~')-1(G3Gz'G1Gz1)(G4D7!). 


I 


338 8. The shallow-water equations 


Hence 
WG" || < |[(G4D~")- || GDI GG "||" |GaGg" |”. (8.79) 


For simplicity we neglect bottom friction, ie. F = 0; in practice, bottom 
friction is found to have a stabilizing effect. The eigenvalues of G3G7! satisfy 


det(G3 _ G2) = 0. 
Expansion gives 


(g3 — Agz)det(C — \C*) = 0, 


c= (157 tr). 
-m 1 


The matrix C is normal, so that CC” is unitary. Hence, the roots of 


where 


det(C — AC?) = 0 


have modulus 1. It remains to inspect |A| = |g3/g2|. Noting that V? < 0, it 
is easy to see that IRe(g2)| > |Re(g3)|. It is also easy to see that |Im(g2)| > 
|Im(g3)|, so that |A(G3GZ 1] < 1, and ||G3Gz "||" is bounded. In a similar 
way it can be shown that \GiGz ii is bounded. Furthermore, 


; 14+ 2 0 jaf d/ge~*92/? 
GD! = 0 g4r./d/ge—*9:/? 0 


uz fJVd/ge il? J/d/ge7*92/? 


Since the elements are bounded, ||G4D~"|| is bounded. Finally, 


= 4 i 1 Hof /ga — H2 
G3 =(1t+72-43)7' 0 (1 4+72-13)/94 0 
~H2 —(1+72)f/94 1+ 72 


Since g4 is bounded away from zero, and 


. iTV 
l+y2—p3 = 1 +o} sin? 02/2 + p7 sind, 


is also bounded away from zero, the elements of (G4D71)7} are bounded, so 
that ||(G4D~+)—"|| is bounded. Hence, we have shown unconditional stability. 
The scheme remains stable when the interior viscous term is deleted. 


Exercise 8.3.1. Derive equation (8.78). 


9. Scalar conservation laws 


9.1 Introduction 


In Chap. 8 we have introduced systems of hyperbolic differential equations 
by discussing the particular case of the shallow-water equations. Especially 
in the linearized case, these form a very simple example of an hyperbolic 
system, permitting an easy introduction of basic concepts as characteristics, 
wave fronts and proper boundary conditions. In the nonlinear case, new phe- 
nomena occur, notably shocks and other types of discontinuity. These occur 
when the local flow velocity is larger than the phase velocity of waves. The 
discretizations discussed in Chap. 8 break down in this case. In order to pre- 
pare for the treatment of compressible flows, we give an introduction to scalar 
conservation laws in the present chapter. 


We start by introducing the concept of monotonicity preservation and 
Godunov’s order barrier theorem. Since all later work circles around these 
matters, we discuss this material in detail. This enables us later to clearly see 
the relation between various properties, notably the concepts of total varia- 
tion diminishing (TVD) and positive schemes. It will be seen that the TVD 
property is harder to obtain than monotonicity preservation, and that in fact 
TVD is not necessary, but monotonicity preservation is. 


More extensive introductions to the vast subject of scalar conservation laws 
and hyperbolic systems of conservation laws may be found in Godlewski and 
Raviart (1996), Kröner (1997), Laney (1998), LeVeque (1992), Majda (1984), 
Smoller (1983) and Toro (1997). 


9.2 Godunov’s order barrier theorem 


The simplest hyperbolic equation is the simplest partial differential equation 
that one can think of: 
Op Op 
dt “Oa ~ 
with c = constant and ọ a scalar function. Without loss of generality we 
assume c > 0. The general solution is 


0, t>0, ceR, (9.1) 
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p(t, 2) = plz — et), (9.2) 


showing that » is convected with the flow with velocity c. The simple problem 
(9.1) deserves special attention, because in a slightly more general form it is an 
essential ingredient in the equations governing fluid dynamics. Oddly enough, 
despite its simplicity, numerical solution of (9.1) is notoriously difficult, and a 
wealth of literature has appeared on the subject. The big difficulty is how to 
circumvent Godunov’s order barrier theorem, which we shall present shortly. 


Monotonicity preservation 


A function f(s) is called monotone if f(s) is either non-decreasing or non- 
increasing as s increases. It follows from the exact solution (9.2) that if p(0, x) 
is monotone, than so is y(t, £), t > 0. Therefore (9.1) is called monotonicity 
preserving. 


As we saw in our study of the convection-diffusion equation in Chapters 4 and 
5, numerical solutions may exhibit spurious wiggles, which are to be avoided 
in most applications; this is also the case for the convection equation (9.1). 
Occurrence of such wiggles is unlikely if the numerical scheme shares the 
property of monotonicity preservation with the differential equation. Denote 
the numerical approximation to p(t”, 2;) by 7, as usual, with {x;} the 
nodes of some computational grid with z; > z;~1, and {t°}, n = 0,1,... 
an increasing sequence of instants. For the present discussion it suffices to 
consider an infinite domain, so j = 0, +1, ..., and no boundary conditions are 
involved. The sequence {p7} = {..-, 971,97) 9}41.---} is called monotone 
if pf is either non-decreasing or non-increasing as j increases. The desired 
property can be formulated as follows. 


Definition 9.2.1. Monotonicity preserving scheme 

A numerical scheme is called monotonicity preserving if for every non- 
decreasing (non-increasing) initial condition {9°} the numerical solution at 
all later instants {pF}, n = 1,2,..., is non-decreasing (non-increasing). 


This is the nonstationary version of the notion ‘scheme of positive type’, 
introduced in Sect. 4.4 for time-independent schemes. 


Godunov’s order barrier theorem 
We follow the original work of Godunov (1959). Let the computational grid 


be uniform: z; = jh, and let the time step be constant: t? = nr. Then all 
(explicit and implicit) one-step numerical schemes can be written as 


Do Bm Pim = LL amam - (9.3) 
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It is assumed that {8m } is such that {prt} is uniquely determined. Since 
(9.3) is a linear mapping between {p?} and {per*}, we can write 


opt = Y Imam (9.4) 


There is no need to express ym in terms of a,, and Bm. 


Theorem 9.2.1. Monotonicity preservation 
The scheme (9.4) is monotonicity preserving if and only if 


Ym 20, Vm. (9.5) 


Proof. (Godunov (1959)) 
If suffices to consider the case n = 0, and the case of non-decreasing {y9}. 


Sufficiency. Assume {y}} to be non-decreasing. Then 
0 . 
pj- ela Y ml 4m — Pjtm—1) 29, VJ, 
m 


where we have used (9.5). 
Necessity. Assume yp < 0 for some p. Choose 
g =0, GF <0; p$ =1, j>0. 
Then 
Pi-p 7P p= P<), 
so that {%1} is not non-decreasing, and we arrive at a contradiction. m 


Godunov (1959) defines second order accurate schemes to be those that re- 
produce the exact solution if (0, x) is a polynomial of second degree, and 
goes on to prove: 


Theorem 9.2.2. Godunov’s order barrier theorem 
Linear one-step second-order accurate numerical schemes for the convection 
equation (9.1) cannot be monotonicity preserving, unless |\c|r/hE N. 


Proof. (Godunov (1959)) 
The general form of one-step schemes is (9.3), which can be rewritten in the 
form (9.4). Choose 


e(.2)=(F-5)?-F, w= (i-5)?- 


The exact solution is 
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z—ct 1 


p(t, s) = (~ — 5) -— (9.6) 


4 ` 
Assuming the scheme to be second order accurate, it reproduces this solution 
exactly: 
1 1 1 1 
1_¢; 2 oL 2 
gj =lj -e-a AU- O. 


where o = cr/h is the signed CFL number. Substitution in (9.4) gives: 
| ly 1 | l, 1 
og) -352 mitm- 3-7) (9.7) 


Now suppose that the scheme is monotonicity preserving. Then Ym > 0, 
according to Theorem 9.2.1. Noting that (j7 +m — $y? — ; > 0, it follows from 
(9.7) that 


, l> 1 . 
—o0-—-)*-->0, Vi. 9.8 
G-0-5 772% W (9.8) 
Assume o > 0, o gN and choose j such that j >o > j —1. Then 
(j-o- 5-5 = -e)i-o -) <0, 
which contradicts (9.8). o 


An alternative proof of Godunov’s order barrier theorem is indicated in Ex- 
ercise 9.2.2. 


Of course, we are not interested in the exceptional case where the CFL num- 
ber o = |c|t/h € N, since this cannot be realized with variable coefficients. 


The local discretization error of scheme (9.4) is defined in the usual way as 
PM ae Smem, = planih, (89) 
m 
where €? is the global discretization error. It is left to the reader to show that 


TPT = p(l- Som) + (Te: — hpr Y mym) 


1 
+ 5 (Tpit — h? Dee 5 M? ym) + O(r? + hë) 


m 


9.10 
=p(1— ` ym) — hps(o +Y mym) + 09-10) 


1 
5h Peelo? — >, mm) + O(r? + h?) ; 
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where ¢ and its partial derivatives pr, Yy etc. are evaluated at nt, jh. It is 
left to the reader to show that if the scheme reproduces (9.6) exactly, then 


r? = O(r? +h). (9.11) 


The use of multistep schemes does not get us past Godunov’s order barrier. 
This we will now show. Linear multistep schemes for (9.1), whether implicit 
or explicit, can be written in the following form (cf. (9.4)): 


5 
PF = Dodo ams PF ym - (9.12) 


s=l m 


This is an S-step scheme. It needs specification of initial conditions at S time 
levels: y°, ...,9°—!. The scheme is called monotonicity preserving if for every 
set of non-decreasing (non-increasing) set of initial conditions {%9}, {ge -1} 
the numerical solution at all later instants {7}, n = S, S + 1,..., is non- 
decreasing (non-increasing). 


Theorem 9.2.3. Monotonicity preservation for multistep schemes 
The multistep scheme (9.12) is monotonicity preserving if and only if 


Yms > 0, s=1,..,5, Ym. 


Proof. Sufficiency follows as in the proof of Theorem 9.2.1. For necessity, 
take the non-decreasing case, and assume 7p, < 0 for some p and r. Choose 
yg; = 0, s=1,...,5, Vj, except for s = S — r: 


Then 


which is a contradiction. Oo 


Theorem 9.2.4. Order barrier theorem for multistep schemes 
Linear second-order accurate numerical schemes for the convection equation 
(9.1) cannot be monotonicity preserving, unless |e|r/h € N. 


Proof. Second order accuracy requires that (9.11) holds, with the local trun- 
cation error defined as (cf. (9.9)): 


S 


moon _ n—s$ 
Tj = £j DODE i 
m 


s=l 


It is left to the reader to show that this results in the following conditions: 
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Nn 


5 5 
YY m: =l, SOY (mtos) ims =0, YOY (m + os) yms =0. 


s=z1 m s=1i m szl m 
(9.13) 
It follows that 


5 Ss 
2 dilm+o(s—N}ame=—o, Did {mt ol- 1) ame = 0? 
=" _ (9.14) 


Assuming the scheme to be monotonicity preserving, we have yms > 0 ac- 
cording to Theorem 9.2.3, and we can write ym; = p2,,. From (9.13) and 
(9.14) it follows that 


5 S5 5 
{SoS mm P = {SoS uw HD So} (9.15) 


szi m szl m s=1 m 


with nms = {m+o(s — 1)}ums. The Cauchy-Schwarz inequality says that 
(a.-b)* < |ia’ llb]? , (9.16) 


with equality only if a = 0 or b = 0 or a and b parallel. Comparing this 
with (9.15) we see that the case of a = 0 or b = 0 cannot occur, because 
some fms > 0 and m+a(s —1) Æ 0 in general if |o| ¢ N. The case of a 
and b parallel applies only if yms > 0 or if m+o(s—1) = constant. Starting 
with s = 1, we see that the constant must be an integer. Because |c| ¢ N, 
no practical scheme is obtained. This means that inequality applies in (9.16), 
and we have a contradiction with (9.15). o 


The way to circumvent Godunov’s order barrier is to use nonlinear schemes, 
as in Sect. 4.8. But first we will continue our discussion of linear schemes. 


Properties equivalent to monotonicity preservation 


The total variation TV(y(t, -)) of the solution of (9.1) is defined as 


oO 
. 1 
TV) =limsup= f jolt,2)- p2- e)lde. (9.17) 
-00 
If the total variation of y does not increase in time, ọ is called total variation 


diminishing or TVD. It follows from the exact solution (9.2) that, writing 
£=z£-—ct 
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TV(p(t,-)) = Himsup =f |p(-2') - o(-2' - eae’ (9.18) 


is independent of t, so p is TVD. It seems desirable that the scheme (9.3) 
mimicks this property. The total variation of the numerical solution is defined 
as 


TV(p")= So le? -øl (9.19) 
J=- 


where we assume that the sum exists. A scheme is called TVD if TV(y”) 
does not increase with n, for all {7}. 


Theorem 9.2.5. TVD property 
The scheme (9.4) is TVD if and only if 


ym >0, Wm, and Som <1. (9.20) 


Proof. It suffices to consider the case n = 0. 


Sufficiency. We have 


0 0 
TV) = YIX mlam -Html YOY hml jtm — Pmi 
jm jm 


Yom X. lee veil < TV(y"). 


k=-—o0 


Necessity. 

Let {p9} be monotone. Assume ym = 0 for |m| > M. Choose 
pS = g- = constant, J < -3M , 
y; = monotone, -3M <j < 3M, (9.21) 
pS = œ+ = constant, J> 3M. 


Assume that the resulting numerical solution {yj} is not monotone. Then 
there is at least one local maximum yj and minimum ¢},, and 


TV(~*) > los — p-lt lok- Pml > lp- 9-1 = TV (9°), 


which is a contradiction. Hence, {y}} is monotone. Because the support of the 
non-constant part of p° is larger than that of {+}, the initial condition (9.21) 
is sufficiently general to invoke Theorem 9.2.1, which proves the necessity of 
the first part of (9.20). Continuing with the second part, assume ym > 0 
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and Y` ym > 1. Then the scheme is monotonicity preserving. Choose p? non- 
m 


decreasing. Then y! is non-decreasing, and we may write 


TV(¢’) = ds  — Pj- 1) = 2 Dam (Y$4m — Ptm- 1) 


= Lie x l — pei) = So mTV(¢") > TV(¢°) , 


k=—-co 


so that we have arrived at a contradiction, showing necessity of the second 
part of (9.20). o 


We see that a TVD scheme is monotonicity preserving, but not vice-versa. 
The difference lies in the second condition of (9.20): 5° ym < 1. Since consis- 


m 
tency requires this condition anyway (cf. (9.10)), for practical purposes TVD 
and monotonicity preservation are equivalent in one dimension. But in more 
dimensions the gap between the two properties is wider, as we shall see. The 
concept of TVD solutions and TVD numerical schemes plays an important 
role in the theory of hyperbolic conservation laws. 


Exercise 9.2.1. Prove (9.10). 


Exercise 9.2.2. Alternative proof of Godunov’s order barrier theorem (Roe 
(1986)). 

Show that if the scheme (9.4) is exact when the exact solution is a second 
order polynomial, then roti as defined by (9.9) satisfies roth = = Ø(T? +h). 
Show that in that case (using (9.10)) and if the scheme is also monotonicity 
preserving, then 


dom)? X (Mam)? = {Do ma}, 


where [fm = „m. Show that this violates the Cauchy-Schwarz inequality. 
Hint: if a,b € R? then |a- b| < |/ajj - ||b||, with equality only if a and b are 
parallel, or a= 0 or b= 0. 


9.3 Linear schemes 
Over the years, a great many schemes have been proposed, some of which 


are better than others, of course. It turns out that many schemes that have 
survived until today can be interpreted as members of a one-parameter family 
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of schemes, namely the «-scheme, introduced by van Leer (1977). We already 
encountered the «-scheme in Sect. 4.8 


A classical way to discretize in time is due to Lax and Wendroff (1960), as 
follows. Using (9.1), we have 


1 
p(t +7) = p(t, z) + rpelt, e) + 57’ pelt, 2) + O(r*) 
9.22) 
= p(t, x) — ere (t, £) + sor peel x)+O(r°), 


where p; = Op/0t, pe = Oy/O0z etc. As in the preceding section, c is assumed 
constant. Approximation of Ys with the «-scheme of Sect. 4.8 and pyr with 
a weighted average of 3-point differences centered at x; and z;—ı for c > 0 
and at x; and 2;4, for c < 0 results in, using a uniform spatial grid with 
step-size h, 


n n 1 n n n n 
gett = p7- zra — K)pF_g + (3K — 5)pj-1 + (38 — 8K) 97 + (1+ s)oj+} 
l n n n n n 
+ 57 {HEF 1 — 2p; + vF 41) + (1- B)(Pj—2 — 293-1 + £5 )},0>0, 
2 
(9.23° 
n n 1 n n n n 
gett = gf t+ z710 + K)pF_1 + (3 — 8K) pF + (BK — 5) + (1 — shejt 
1 n n n n n n 
+ 57 {eleja — 297 + piy) + 1 — ele - 26741 t P aO, 


where ø = cr/h is the signed CFL number. Scheme (9.23) can be rewritten 
as 


n n T n 
gett = p + 7“ — 2po — 1 + 20) pF 2 
o 


n T n 
+ 1 (-3K + buo +5 — 4o)pe y+ z“ — bpo — 3 + 20) 9% 


O 
+ Glow tuo- Nes, 720, 


n n T n o n 
PFT = pp t Z(t no + 1)pj-i + Z (3K — bus + 3 + 20) yp 
o n o n 

+ 78K + uo — 5- 40)pjyi + g(s 2Ha + 1+ 2e) pi 42 | <l. 


(9.24) 


We see that effectively there is only one free parameter, namely « — 2uļol. 
Defining 


-i,3 K|o| 9.25 
Haat 5k o|, (9.25) 


the x-scheme as formulated by van Leer (1977) is recovered: 
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o 


pjt = oF + 7 (-1 +6 + (1— 3K)o + 2x0") oF 

+ = (5 — 3K + (9x — l)o — 6x0") pF, 

4 q(-3 + 3K — (1 + 9K)o + 6Ko*)y? 

+ F(-1— m+ (1+ 3n)o 2x07) ph, , >D, (9.26) 
ett = gh + T(L+ e+ (1+ 3e)o + 2K0?)g a | 

+ 703 - 3K — (1 + 9K)o — 6Ko") yp? 

+ F(-5 43m — (1-9n)o + 6x07) oF 44 

+ FU- «+ (1—3K)o = no") pF 40, a<Q. 


By Taylor expansion and by using the differential equation to replace time 
derivatives by space derivatives we find for the local truncation error (dividing 
(9.26) by T to make the scheme approximate the differential equation): 


d (nr, jh) 


ðr? 
4 jh 
+ ch {To — K) + (21K — 8)? — 14x07 + ot eenn jh) plnr, J 


1 
r = Tgh (3K — 1)(1 — 37r + 20°) 


which shows that the x-scheme has third order accuracy for x = 1/3 and 
second order accuracy otherwise. 


The classical scheme of Lax and Wendroff (1960) is obtained by substituting 
u = K = 1 in (9.24) (hence (9.25) is not valid). We obtain, both for positive 
and negative c: 


gett = AG +0)pj-1 +(1-0°)p} + z —o)PF41- (9.28) 


By Taylor expansion we find for the local truncation error: 


1 


r”? = —ch?(1 _ 92) Zelar, jh) 
7? 6 


aga + O(h’). (9.29) 


For « = 1/3, equation (9.26) gives the QUICKEST scheme proposed by 
Leonard (1979). As noted before, this value of x gives the smallest trunca- 
tion error. Note that the QUICK scheme, proposed in Leonard (1979) for the 
stationary case, corresponds to «x = 1/2. 


For «x = 0, (9.26) gives the scheme proposed by Fromm (1968). It is designed 
to have a small phase error. In Wesseling (1973) it is shown that among all 
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linear schemes that share the stencil of scheme (9.24), Fromm’s scheme is 
optimal when the exact solution is a propagating step-function. 


The fully one-sided upwind scheme proposed by Warming and Beam (1976) 
is obtained from (9.26) by choosing « — 24o = —1. 


With the Lax-Wendroff approach, temporal discretization takes place by re- 
placing the time derivatives in (9.22) by space derivatives, using the differ- 
ential equation (9.1). When a source term is present, or even more so when 
there are other differential terms, such as a diffusion term, this procedure be- 
comes less straightforward. Another disadvantage of this way to do temporal 
discretization occurs when the sole aim of the computation is to compute a 
steady state. When steady state is reached, gett = p7}, and equation (9.24) 
shows that the final solution depends on the time step 7 through ø. This is 
unsatisfactory. These disadvantages are avoided by discretizing first only in 
space, as in Chap. 5. Using the «-scheme, this results in the following system 
of ordinary differential equations: 


c>0: 

dpj/dt = —3{(1— )pj—2 + (84 — 5)pj-1 + (8 — 3K) yj + (1+ 6) esti}, 
e<0: 

dpj/dt = Ri + &)pj-1 + (3 — 8x) 9; + (86 — d)pj41 + (1 — K) yj 42} - 


(9.30) 


Next, a discretization method for ordinary differential equations is applied. 
Examples have been given in Chap. 5. 


Further schemes that should be mentioned are: 


the first order upwind scheme: 
l 
dej/dt = -z-le + lel)p;-1 + 2e; — (e— lep}, (9.31) 
the fourth order central scheme (cf. Sect. 4.8): 
1 
dp; /dt = -T57 {pj-2 ~ 89j-1 + BY j41 — Pit}, 
and the Lax-Friedrichs scheme (Lax (1954)): 


gett = 5 (Pj~1 + Pi4i) ~ 9 Pj+ - 95-1): (9.32) 


With forward Euler temporal discretization, (9.31) goes over in the Courant- 
Isaacson-Rees scheme (Courant, Isaacson, and Rees (1952)): 


n n 1 n n n 
PF = g} -aile + loej- +t 2loles - lo- lole} (9.33) 
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These schemes do not belong to the class of «-schemes. The first order up- 
wind and Lax-Friedrichs schemes are first order accurate. It is easy to see 
that the Lax-Friedrichs and Courant-Isaacson-Rees schemes are monotonic- 
ity preserving for |o| < 1. This is also the case for the first order upwind 
scheme, for suitable temporal discretizations. 


Boundary conditions 
Consider the same equation as before, but now on a bounded domain: 
OF |, OF 4, T>0, 2€(0,1), c= constant. (9.34) 
£ 


Recalling the discussion of characteristics and boundary conditions in Sect. 8.2, 
we see that equation (9.34) has characteristics parametrized by 


di i dz 

de’ ds 
Assuming that an initial condition is given for (0, x), a boundary condition 
is required for y(t, 0) if c > 0 and for y(t, 1) if c < 0. The same is true for the 
first order upwind scheme. However, all other schemes discussed before need 
additional artificial boundary conditions, because the stencil of the spatial 
discretization contains three or more points. Assume c > 0, and let the grid 
points be z; = jh, j = 0,1,...,J, h = 1/J. In (9.30), 9; (0) is given by the 
initial condition, and yo({t) is specified by the boundary condition. In order to 
complete the equation for dy, /dt, we need an artificial boundary condition 
to specify y_,(¢). Similarly, to complete the equation for dy, /dt, an artificial 
boundary condition is needed for y441(t). In the theoretical literature, the 
most common way to generate artificial boundary conditions is to extrapolate. 
For example, linear extrapolation gives 


p-1(t) = 2p0(t) — vilt), pati (t) = 293 (t) — pJ-1(t) . (9.36) 


Another option is to avoid the use of artificial boundary conditions by us- 
ing the first order upwind scheme for j = 1, and a fully one-sided upwind 
scheme (first order, or x = —1 or the Beam-Warming scheme) at j = J. 
In the present simple case, we found no significant differences between these 
approaches in numerical experiments. 


=c. (9.35) 


Gustafsson (1975) has shown that the order of accuracy of the artificial 
boundary conditions can be one order lower than for the interior scheme, 
without impairing the overall order of accuracy. With (9.36), scheme (9.30) 
at the boundary becomes (c > 0): 


C 


dpı/dt = -7z i(w — 3)po + (2 — 2K)p1 + (1 + r)p2} 
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It is left to the reader to show that 
dy, /dt = ~cOp(t,0)/Ax + O(h?) 


so that the artificial boundary condition (9.36) is sufficiently accurate, even 
for x = 1/3, in which case the discretization error in (9.30) is O(h?) in the 
interior. It is of further interest to note that Kreiss and Lundqvist (1968) 
have shown that if the numerical scheme is dissipative (a concept that will 
be discussed shortly), then the influence of artificial boundary conditions is 
limited to an interval of length O(A| ln Al). 


As we saw in Sect. 5.9, artificial boundary conditions at an outflow boundary 
may give rise to spurious reflection. However, in the present simple case spu- 
rious reflection did not show up in the numerical experiments to be described 
later. Spurious reflection becomes an issue when other terms are added (e.g. 
diffusion), or when systems of equations are considered. 


Stability 


Stability analysis for the mixed initial-boundary value value problem (9.34) 
is presented by Richtmyer and Morton (1967) and Gustafsson, Kreiss, and 
Sundström (1972). This turns out to be a rather complicated affair, that we 
will not discuss. It is necessary that the scheme is stable for the pure initial 
value problem: 


a + oS, =! t>0, -w<r<cn. 


Stability for the pure initial value problem can be analysed (in the case of con- 
stant coefficients) with the Fourier analysis method introduced in Sect. 5.5, 
and we will do this here. In Sect. 5.5, the domain is finite with periodic bound- 
ary conditions, and grid functions can be represented by the finite Fourier 
series (5.20). In the present case of a pure initial value problem, we have 
an infinite uniform grid, and grid functions can be represented by a Fourier 
integral: 


Theorem 9.3.1. Fourier integral 
If {p;: p; E€ R, j = 0, +1, +2,...} satisfies 


0a 
Dee} <o, 


j=- 


then there exists a function (80) : (—r, n) —> R such that 


T 


Ay ij , 1 € is 
Pj = | (6) dd, g(8) = In ` pje. (9.37) 


-T j=- 00 
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Proof. See Sect. 9.2 of Dahlquist and Bjorck (1974). o 


As in Sect. 5.6, we study the behavior of a perturbation ô; (t) caused by a 
perturbation ó; (0) of the initial condition. Substitution in (9.30) gives 


dô; /dt + Lað; =0, 


where L, follows from comparison with (9.30). Substitution of (9.37) gives 


J {di(t, @)/dt + En (0)4(t, 0) }e8do = 0 , (9.38) 


ix 
where, as in Sect. 5.5, L, is the symbol of La, defined by 
Ly (0) = e Lp et? , 
From (9.38) it follows that 
dé(t,0)/dt + L,(0)d(t,6) =0, 
and, similar to Sect. 5.6, we have stability, i.e. |d(¢,@)| < |6(0, 0)|, provided 
-Êr ES, —-r<O<7n, 


with S the stability domain of the temporal discretization method used to 
approximate (9.30). The symbol £, (8) follows from (5.34) by taking the one- 
dimensional case, replacing cı by o and putting the viscosity D = 0, or 
dı = 0. The same substitutions transform the stability conditions derived 
for a number of temporal discretization methods in Sect. 5.7 and 5.8 into 
stability conditions for the initial value problem (9.30). 


In a similar way, stability conditions can be derived for the Lax-Wendroff 
temporal discretizations for the «-scheme (9.24) or (9.26). 


We substitute 


and find 


We have stability if 


lg(A)| <1, VO. (9.39) 
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Theorem 9.3.2. Stability of Lax-Wendroff scheme 
A necessary and sufficient stability condition for the Laz- Wendroff scheme is 


lol <1. (9.40) 
Proof. The Lax-Wendroff scheme is given by (9.28). We find for all ø: 
g(0) = 1 — @?°(1 — cos 0) ~ io sinb . (9.41) 


By taking 0 = m we see that (9.40) is necessary for (9.39). Furthermore, 
writing s = sin? 36, we have 


lgl? = 14+ 4s%07(o? —1) <1 (9.42) 
where we have used (9.40). Hence, (9.40) is sufficient for (9.39). a 


Theorem 9.3.3. Stability of k-scheme 
A necessary and sufficient condition for the k-scheme (9.26) to be stable for 
-l<K«<lis 


lo] <1. 


Proof. For o > 0 the amplification factor is given by 


g(0) = 1— 207s + 2as? + i(as—c)sin6, s= sin? 36, (9.43) 
a = o(o—1)}(l—K+ 2k). 

We find 

Ig}? = 1+ 4s?(a — 2ac — o° + a4) + 4as? (a + 20 — 20°) . (9.44) 
Since 0 < s < 1, |g]? < 1 for all 6 is equivalent with 

a — lac — 0? +04 + as(a + 20 — o°) <0, O<s<l, 
which is equivalent with 
a — ĉ2ao — 0o? +0 <0 and a(a+20—07) <0. (9.45) 

We start with sufficiency. We have 

a — laa — o? +04 = a(o — 1){1 -0 +67 — k(l- 20)*}. (9.46) 


From —1 < «x < 1l and 0 < ø < 1 it follows that 
l-o+o0*—«(1- 20) > 1—-o +0? — (1 — 20)? = 3(o — 0?) > 0, (9.47) 


establishing the first part of (9.45). Continuing with the second part, we have 
a < l, since for —1 < x < 1 the extrema of 1—«+2k«e for fixed c are attained 
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for « = +1, and both are non-negative. It remains to show that in (9.45) we 
have a + 20 — o° > 0. We find, using 0 < o < 1, 

a +20 -o° =a(o—1)(2h0 —-1—K)>0, 


since the extrema of the last factor for o fixed and —1 < « < 1 are attained 
for « = +1, and both are non-positive. 


Necessity is shown as follows. From (9.45) it follows that is necessary that 
a—2ac—o? +o° <0. 
We have 
a — 2?ac ~ o? + 0% = o(a — 1){1 — k + (4k — lo +07(1 —«)} . 


The term between braces is positive for —1 < « < 1/4, Vo and changes sign 
forc > 0 at 


n = Zq +v3/(4k-1)), «>1/4. 


Since a; > 1 for « < 1 the scheme is unstable for 1 < e < c1. It might be that 
the scheme is stable for ¢ > c1, but this does not need to be investigated, 
because a scheme that becomes instable if the time step 7 (and hence a) is 
decreased is impractical. For the case s = 1 we look at the second part of 
(9.45). We find 


a(a + 20 — o°) = 20° (a — 1)(20? — 30 + 2) , 


which is of wrong sign for o > 1. This completes the proof for ø > 0. The 
case g < 0 is left to the reader. o 


Theorem 9.3.4. Stability of Laxr-Friedrichs scheme 
A necessary and sufficient condition for the Laz-Friedrichs scheme (9.32) to 
be stable is 


lol<1. 


The proof is left to the reader. 


Theorem 9.3.5. Stability of Courant-Isaacson-Rees scheme 
A necessary and sufficient condition for the Courant-Isaacson-Rees scheme 


(9.33) to be stable is 


lol <1. 
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The proof is left to the reader. 


We can now state what a dissipative scheme is, a concept that was referred 
to earlier. Kreiss (1964) gives the following definition: 


Definition 9.3.1. Dissipative scheme 
A discretization scheme is dissipative of order 2r,r a non-negative integer, if 
there exists a constant 6 > 0 such that 


IO < 1-80, -r<0<r. (9.48) 


We show that the Lax-Wendroff and «-schemes are dissipative. For the Lax- 
Wendroff scheme we have, from (9.42): 


gl? < 1- 7640?(1 o°). (9.49) 


Since, obviously, we want T to be greater than zero, it is reasonable to assume 
g? > e£ >Q. If also a? < 1 — e then (9.49) gives 


l 
lgl? < 1- 66%, s= zele), if e<o?<l-e, 


so that the Lax-Wendroff scheme is dissipative of order 4. For the x-scheme, 
(9.44) gives, assuming 0 < c < 1, 


gl? < 14 4s*(a — 2a0 — o* +0), 


since a(a + 20 — 207) < 0, as shown in the proof of Theorem 9.3.3. From 
(9.46) and (9.47) it follows that 


ll? <1- o%0%(1 —o*), 
hence 
lg? < 1- 66%, 5 = Že(l —e), if e<o?<1l-e, 
so that the x-scheme is also dissipative of order 4. 


An advantage of dissipative schemes has been mentioned before, namely that 
the influence of artificial inflow boundary conditions is limited to a layer of 
thickness O(h|ln A|). Another advantage is discussed by Richtmyer and Mor- 
ton (1967) Sect. 5.4, namely that we have stability in the variable coefficients 
case. 


It is left to the reader to show that with o = 1 the Lax-Wendroff and x- 
schemes reproduce the exact solution. But for @ = 1 the schemes are not 
dissipative. Nevertheless, artificial inflow boundary conditions have no in- 
fluence. In fact, it turns out that a weaker condition than dissipativity is 
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sufficient to restrain the influence of artificial boundary conditions; Kreiss 
and Lundqvist (1968) show that it is sufficient if the scheme is contractive: 


IgM) <1, |g(@+ia)|<e~?, O<, 


for some a > 0 and a constant 8 > 0. For general e, to prove contractivity 
is more difficult than stability, but the case |ø| = 1 is easy and is left to the 
reader. 


Numerical dissipation and dispersion 


The accuracy with which a numerical scheme approximates the propagation 
properties of wave-like solutions of hyperbolic equations depends on the dis- 
sipation and dispersion of the scheme, concepts that we encountered earlier, 
namely in Sect. 8.2. 


Postulating a harmonic wave p = gexp{i(kz — wt)} for the solution of (9.1) 
we find the following dispersion relation: 


wock. 


We recall from Sect. 8.2 that the phase velocity and group velocities are 
defined by 


Cp = Re(w)/k, cg = dRe(w)/dk . 
Hence, in the present case we have 
Cp = Cg =C. 


so that there is no dispersion: c, does not depend on k. Assuming c > 0, 
substitution of a harmonic wave y? = pexp{?(jkh — wat)} in a difference 
scheme on a uniform grid with step-size h gives a relation of the form 


exp(—twnT) = g(8), O=kh. (9.50) 
For example, for the Lax-Wendroff scheme (9.28), g is given by (9.41), 
whereas for the «-scheme, g is given by (9.43). It follows from (9.50) that 


a 
Wh = -lng , 
T 


so that the numerical phase and group velocities are given by 


li ld 
A h g 
Cp = —Im{ = lng} P Cg = lm zak . (9.51) 
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In general, numerical schemes will have dispersion. The fidelity of the scheme 
in reproducing the propagation properties of the exact solution can be as- 


sessed by comparing c? and cp, and cj and cg. This can also be done by 


comparing the numerical amplification factor g(@) with the exact amplifica- 
tion factor ge(0) over one time step, given by 
ge (9) — eet = eT tckT — eta , o= er/h ; 


The numerical dispersion of a wave with wavenumber k is defined as, writing 
g = |g\exp(iarg(g)) , 


a(6) = arg(ge) — arg(g) - 


The numerical dispersion has the following physical interpretation. Since 
arg(ge) = —ckr and Ing = In|g| + targ(g) , we see from (9.51) that 
arg(g) = —cpkr, so that 

a(8) = (eh —c)kr= (ci —ce)r2ar/L, 


where L is the wavelength, so that a(6) measures 27 times the distance that 
the numerical wave travels too far in one time step, divided by the wavelength. 


The numerical dissipation is defined by 
(0) = |ge| — |g! . 


This measures the amount by which the numerical scheme reduces the am- 
plitude too much over one time step. In the present case, |ge| = 1. 


For the Lax-Wendroff scheme (9.28) we find from (9.41): 
arg(g) = —arctan[osin6/{1 — 07(1 — cos 8) }} . 
Furthermore, 
arg(ge) = —08 . 
Hence 
a(@) = arctan[o sin(6)/{1 — a° (1 — cos 0) }] — 06. 
From (9.42) we find 


(6) = 1 — {1 + 4870? (o? —1)}"/?, s= sin’? x0 . 


For the x-scheme (9.26) we find from (9.43) and (9.44) 


a(b) = arctan{(o — as) sin6/(1 — 207s + 2as*)} — o0 , (9.52) 
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(0) = 1 — {1 + 4s" (a — 2a0 — o° + o4) + 4as? (a + 20 — a?)}!/?, (9.53) 
where a = o (o — 1) (1 — k + 2ko) . 


We see that both the Lax-Wendroff and the x-scheme contain dispersion as 
well as dissipation. The dispersion ¢(0) and dissipation a(@) for these schemes 
are plotted in Figs. 9.1-9.4. Comparison shows that the «-scheme has less 





a 02 0.4 0.6 0.8 1 


Fig. 9.1. Dispersion (left) and dissipation (right) versus @/m for the Lax-Wendroff 
scheme. —- : a = 0.3; - - - : o = 0.6; —--~-— :o =0.9. 


dispersion than the Lax-Wendroff scheme. The x = 0 scheme was designed 
by Fromm (1968) for low dispersion, and indeed it comes out best in the 
dispersion plots. The x = 1/3 scheme has third order spatial accuracy (cf. 
(9.27)), and has less damping than the other «-schemes, Numerical damping 
is not all bad; in fact, it is desirable that modes that propagate at wrong 
phase velocity are damped. Therefore it is favorable that the schemes con- 
sidered here have significant damping for values of 6 for which dispersion is 
significant. 


Numerical experiments 


To see what happens in practice, the schemes discussed are applied to (9.34) 
with c = 1. The exact solution is p(t, £) = f(z — t). For f we choose 


f(z) =0, z ¢(-0.8,-0.1), f(z) =1, 2 € [-0.3, —0.1), (9.54) 
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Fig. 9.2. Dispersion (left) and dissipation (right) versus 9/m for the x-scheme, for 
o0=—0.3.---:«K =-1,—-:«K =0;----- cK = 1/3. 


or for x € [—0.3, —0.1], 
f(z) = 1 — cos? 5r (x + 0.3). (9.55) 


This exact solution is a pulse that travels with velocity e in the positive z- 
direction. The pulse enters the domain at t = 0.1, and has left at ¢ = 1.3. 
In the case of (9.55) the pulse is twice differentiable, whereas in the case of 
(9.54) it consists of two step functions. We take the mesh size h = 1/30, 
so that the support of the pulse equals 6h. For the CFL number we choose 
a = ch/r = 0.7. Results for the Lax-Wendroff scheme (9.28) are given in 
Fig. 9.5 and for the «-scheme (9.26) in Figs. 9.6 and 9.7. The pulse is shown 
after entering, after traveling through most of the domain and when leaving. 
The exact solution is represented by drawn curves, the numerical solution by 
circles. In both cases, we found almost no difference between the two ways to 
generate artificial boundary conditions that were discussed, and results are 
shown only for linear extrapolation (9.36). Spurious reflection from the out- 
flow boundary appears to be absent. The Lax-Wendrofff scheme lets the pulse 
(especially the smooth one) travel too slowly, as predicted by the dispersion 
plot (Fig. 9.1). As is to be expected because of Godunov’s order barrier the- 
orem, both schemes show spurious wiggles, but these are much more severe 
for the Lax-Wendroff scheme than for the «-scheme. Although both schemes 
are second order accurate, the x = 0 scheme is significantly more accurate 
for the smooth pulse than the Lax-Wendroff scheme, for which not only 
the propagation velocity is too small but which also has more dissipation, 
as predicted by the dissipation plots in Figs. 9.1-9.4. Because of its third 
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Fig. 9.3. Dispersion (left) and dissipation (right) versus @/m for the x-scheme, for 
a = 0.6. ---: k= -l;—:k= 0; : K = 1/3. 


order accuracy, the x = 1/3 scheme is slightly more accurate than the x = 0 
scheme. For the step function pulse the exact solution is not smooth enough 
for the formal accuracy of the «-schemes to pay off, althought they maintain 
their advantage of smaller spurious wiggles. 


Often, spurious wiggles cannot be tolerated at all. Measures to obtain second 
order accurate wiggle-free schemes, such as flux limiting, were discussed in 
Sect. 4.8 for the stationary convection-diffusion equation. Before discussing 
this we abandon the linear case and consider the nonlinear case. 


Exercise 9.3.1. Perhaps the simplest scheme that one can think of for the 
differential equation Op/dt + cðp/ðz = 0 is 


1 
OP 9h + 50(¥j 41 =, o= erh. 


Show that this scheme is unstable by means of Fourier stability analysis. 


Exercise 9.3.2. Show that for |o| = 1 the Lax-Wendroff and «-schemes are 
contractive. 


Exercise 9.3.3. Prove theorems 9.3.4 and 9.3.5. 


Exercise 9.3.4. Prove that the Lax-Friedrichs and Courant-Isaacson-Rees 
schemes are dissipative. 
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Fig. 9.4. Dispersion (left) and dissipation (right) versus 6/7 for the «-scheme, for 
o0=0.9.---:K =l; —:kr=0 : k = 1/3. 


9.4 Scalar conservation laws 


The nonlinear generalization of (9.1) is 


ag | ary) 


with y a scalar (which in this context merely means single) function; this is 
why this is called a scalar conservation law. For the time being, boundary 


conditions will not be considered, which is why the domain is taken to be 
unbounded in z. An initial condition is required: 


(0,2) = p° (2). (9.57) 
Integration of (9.56) from z = a to z = b gives 


b 


d 
< | ede = flp). 


a 


This shows that the total amount of ọ in (a, b) can change only by transport 


or flux through z = a and z = b; f is called the flux function. If there is 
b 


no transport, f pdz is constant or conserved, which is why (9.56) is called a 


a 
conservation law. 
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Fig. 9.5. Numerical results for the Lax-Wendroff scheme. 
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Fig. 9.6. Numerical results for « = 1/3 scheme. 
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Because almost all (if not all) successful numerical methods for computing 
inviscid compressible flows are based on methods for scalar conservation laws 
in one dimension, and because there is a well-developed theory mainly for 
this case only, it is worthwhile to study properties of and numerical methods 
for scalar conservation laws, which we will do in this section. Despite the 
apparent simplicity of (9.56), this is a large subject, and the reader may wish 
to consult more extensive introductions to the subject. A good starting point 
is LeVeque (1992). Classics in the field are Courant and Friedrichs (1949) and 
Lax (1973). Recent expositions of the mathematical theory are Godlewski and 
Raviart (1996) and Smoller (1983). More emphasis on numerics is found in 


Sod (1985). 
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Fig. 9.7. Numerical results for x = 0 scheme. 


Characteristics 


If f is differentiable, equation (9.56) can be rewritten as 


dp Of _ _ df 
This shows that y is constant along characteristic curves defined by 
dt dz 
t=t(s), x=a2(s), ist? qg T). 


Since y and hence c(y) is constant along characteristics, the characteristics 
are straight. Depending on the initial condition p°(¢), it may easily happen 
that characteristics intersect. This would make the solution multi-valued, 
which is physically impossible. What happens is that a discontinuity, called a 
shock, develops. Even for smooth initial data y°(t) the solution may become 
discontinuous in finite time, as illustrated in the following example. 


Example 9.4.1. Inviscid Burgers equation 


The Burgers equation (Burgers (1948)) is given by 
de fle) _ ay 1 a 
att ade dx?’ Hy) = QP: 

For v = 0 we obtain the inviscid Burgers equation, which is the simplest 

example of a nonlinear scalar conservation law. The initial condition is chosen 

as follows: 


(9.58) 


y(x4)=1, r<0; gz) =0, z>1; y®EC™(R). 
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We leave y? unspecified for 0 < x < 1, except that we stipulate that °? is 
infinitely differentiable and monotone. Characteristics are plotted in Fig. 9.8. 
Independent of the specification of g? for 0 < z < 1, the characteristics cer- 


1 


Fig. 9.8. Characteristics for inviscid Burgers equation. 


tainly intersect, for t > 1 at the latest. Hence, although the initial condition 
p°? € C™, a discontinuity develops in finite time. o 


Weak solutions and shocks 


As said before, a physically relevant solution is single-valued, and where char- 
acteristics intersect the solution becomes discontinuous. Since classical solu- 
tions of (9.56) are differentiable, the class of solutions has to be extended. 


Definition 9.4.1. Weak solution 
A bounded integrable function p(x,t) is called a weak solution of the initial 
value problem (9.56), (9.57) if 


J J Cote + Hove dadt + f 0a) = 0 (9.59) 


0 -œ 
for all y € CA(Rt x R). 


Here C4 (92) is the space of functions that are continuously differentiable, and 
that are identically zero outside some bounded set inside 2. Note that (9.59) 
makes sense even if the solution ọ is not continuous. It is left to the reader 
to show that if a weak solution is in C4, then it is a classical solution. 


We will now study the jump condition that holds at a discontinuity. Let 
x = y(t) be a curve I’ along which there is a discontinuity, and let 
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t)= lim py(é,r), r(t) = lim p(t, z). 
gi(t) jim. Pl )+ Pr(t) = lim pf ) 


Let p be differentiable away from z = y(t). Let P = (to, y(to)) be a point 
on I’, and let D be a neighbourhood of P. Choose y € C9(D). Let D; and 
D, be the parts of D that are to the left and to the right of I, respectively. 
Since y is differentiable in D; and D,, we can apply the divergence theorem, 
and (9.59) gives 


o= f [tov + tedeat=(f + f yo(-pde + far). 
D aD, OD, 
Since 7 = 0 on OD, the only contributions come from IP: 
Po 
o= f wlplde —[F(p)]dt) , 
Py 


where Pı and P; are the intersection points of I and D, and 


[yp] = vr (t)— vit), e)l = Fler) — Fler) - 


Since 7 is arbitrary, we must have 


sly] =[f(y)], (9.60) 


where s = dy/dt is the propagation speed of the discontinuity. Such a dis- 
continuity is called a shock. Equation (9.60) is called the jump condition. It 
determines the location and the size of the discontinuity, and a single-valued 
weak solution is obtained, as illustrated in the following example, due to Lax 
(1973). 


Example 9.4.2. Solution of inviscid Burgers equation 
Consider the inviscid Burgers equation (9.58) with the following initial con- 
dition: 
e'(r)=1, «<0; yr) =1—-2z, 0<2<1; gz) =0, x>1. 

The characteristics intersect for ¢ > 1. It is left to the reader to show that 
for ¢ < 1 the solution is given by 

p(lt,z)=1, <t; p(t,z)=(1-—z)/{1-t), t<e<1; 9.61 
p(t,r)=0, z>1. (9.61) 


For t > 1 we have a shock. Clearly, yı = 1 and y, = 0, so that the jump 
condition (9.60) gives s = 1/2. The shock starts at (t, £) = (1,1), so that for 
t > 1 a solution is given by 


1 
plt,z)=1, z<i+z5@ë-1); plt,z)=0, z>1+5@-1). 


The characteristics and shock are depicted in Fig. 9.9. o 
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Fig. 9.9. Characteristics and shock for Example 9.4.2 


The entropy condition and uniqueness 


It turns out that the weak formulation (9.59) together with the jump condi- 
tion (9.60) are not sufficient to determine a unique solution, as illustrated by 
the following example, again due to Lax (1973). 


Example 9.4.3. Multiple solutions of inviscid Burgers equation 
Consider the inviscid Burgers equation (9.58) with initial condition 


y'(2r)=0, 2<0; g(r) =1, z>0. 
By using the characteristics we see that 
plt,z)=0, z<0; p(t,z)=1, z>t. 


But what is the solution for 0 < z < t? A solution satisfying the jump 
condition is 


p(t,z)=0, z<t/2; pít,z)=1, z>t/2. (9.62) 
But another possible solution, not containing a shock, is given by 


p(t,z)=0, «<0; pļlt,xz)=z/t, 0<2<t; 


p(i,2)=1, z>t, (9.63) 


and there may be more solutions. Anyway, two is too many! o 
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The question arises what additional condition should be imposed to make 
weak solutions unique. In fluid mechanics, an inviscid mathematical model 
is the vanishing viscosity limit of a viscous model. Analogously, in the case 
of the inviscid Burgers equation we require that the weak solution should be 
the v | 0 limit of a classical solution of the (viscous) Burgers equation. This, 
and other equivalent conditions, all go by the name of entropy condition. This 
name derives from the fact, that in the presence of viscosity (however small) 
a real physical flow causes the entropy of a fluid particle to be non-decreasing. 


What is now needed is a mathematical formulation of the entropy condition 
that makes it (relatively) easy to check whether it is satisfied or not by a 
weak solution. One such formulation, due to Oleinik (1957), is the following: 


Definition 9.4.2. Entropy condition 
A weak solution is said to satisfy the entropy condition if in every point of 
every shock we have 


— (i) if pr < pı: The graph of y = f(y) over [pr, pi] lies below the chord 


connecting (pr, f(yr)) to (u, F(¢t)); 
— (ii) if pr > yi: The graph of y = f(y) lies above the chord. 


A weak solution that satisfies the entropy condition is called an entropy so- 
lution. Note that the entropy condition says something about shocks only. 


What does Definition 9.4.2 have to do with entropy? Suppose that f is con- 
vex: f° > 0, as is the case, for example, for the inviscid Burgers equation. 
According to the jump condition (9.60) the shock speed equals the slope of 
the chord. Condition (i) is in the case f > 0 equivalent with 


elyr)<s<e(yi), ely) =f (p), (9.64) 


and this is obviously satisfied if f > 0. If p, > pı we have for f >0 


e(r) > s > ely), 


and this is forbidden by condition (ii). Hence, in the case of convex f the 
entropy condition says that characteristics never emanate from a shock but 
always enter it. This means that from every point on a shock we can travel 
along characteristics backward in time only and not forward. In other words, 
information reaches the shock from the past only and not from the future. 
One may therefore regard Definition 9.4.2 as a causality principle. By distin- 
guishing between past and future this entropy condition says something about 
the arrow of time: it implies time-irreversibility. Irreversibility is intimately 
related to the second law of thermodynamics, saying that entropy does not 
decrease in a closed system. Hence the connection between Definition 9.4.2 
and entropy. 
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There are several equivalent formulations of the entropy condition, see Lax 
(1973) or Smoller (1983). The entropy condition guarantees that a weak so- 
lution is a vanishing viscosity solution and is unique, but we will not prove 
this. 


It is easily seen that the entropy condition tells us how to choose between the 
two solutions (9.62) and (9.63) of the inviscid Burgers equation. In this case 
f > 0, and (9.64) should be satisfied by the discontinous solution (9.62), but 
it is not; hence, (9.62) is not an entropy solution. Since according to (9.63) 
y is continuous, it is automatically an entropy solution; as asserted above, it 
is unique. 


Non-equivalence of conservation laws 


Two conservation laws that are equivalent for smooth solutions may not be 
equivalent for weak solutions. For example, the two conservation laws 


l 2 
Pt + z(e )e =0 and (p°) + S(p’) —0 


are obviously equivalent for differentiable ». But the respective jump condi- 
tions are 


sle] = slo] and sly") = Ziy], 


leading to the following shock propagation velocities, respectively: 


1 2. 
s=5(yrt yr) and s= Slept prp t yr)/(er + 41): 


which are obviously different. In physical applications it is clear which form 
of a conservation law should be taken: the form that follows directly from the 
basic integral form of the physical conservation law. 


Total variation and monotonicity 


The solution of the conservation law (9.56) is total variation diminishing 
(TVD) and monotonicity preserving. It is desirable that numerical schemes 
mimick these (almost equivalent) properties, which we therefore discuss. 


The total variation of a function y(t, z} is defined in equation (9.17). 


Theorem 9.4.1. Uniqueness, TVD and inverse monotonicity 

Assume that y? € L®(R) and let f(y) be differentiable. Then the problem 
(9.56), (9.57) has a unique entropy solution p E€ L® (R+ x R). This solution 
satisfies for almost allt > 0 


9.4. Scalar conservation laws 369 


elt, Mize < lle? - 


If p and y are entropy solutions associated with the initial conditions y® and 
v°, respectively, then we have 


p> wv = v(t.) > v(t,-) (9.65) 
almost everywhere. If y° has bounded total variation, then 
TV(p(t,)) < TV(p?) . (9.66) 


For a proof we refer to Godlewski and Raviart (1991), Chap.II (Sect. 5). Here 
L° is the space of integrable functions with norm 


lolt, -)||[z0 = inf{sup{y(t, x): z € R\A, A is set with zero measure} . 


Equation (9.65) says that the equation is inverse monotone with respect to 
the initial condition. Equation (9.66) says that the solution is total varia- 
tion diminishing or TVD. More precisely, the solution is total variation non- 
increasing, but we prefer the more crisp and commonly used terminology 
TVD. 


Theorem 9.4.2. Monotonicity preservation 
If p? is monotone and is constant outside a bounded domain, then y(t,-) is 
monotone. 


Proof. If suffices to consider the case rz > 21 => y°(x2) > y°(21). It follows 
that 


TV(p") = y? (00) — y°(—00) . 


0 is constant outside a bounded domain, we have 


Because 
p(t, +00) = y? (+00) . 


If p(t, +) is monotone, then 


TV(p(t,-)) = p(t, œ) — p(t, =œ) = TV (4°) . (9.67) 
If p(t,:) is not monotone, its total variation is larger than given by (9.67), 
violating (9.66). o 


This theorem says that TVD implies monotonicity preservation. 


It is desirable that numerical solutions mimick these properties. This means 
that spurious numerical oscillations (‘wiggles’) are to be avoided. Especially 
near discontinuities in the solution these can be a nuisance. 
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The Riemann problem 


The Riemann problem for (9.56), (9.57) is defined by the following special 
initial condition: 


p(z)=y, 2<0; P(r) = pr, 2>0, (9.68) 


with y and », constant. The solution of the Riemann problem plays an 
important role in the construction of many numerical methods. We will derive 
its solution. 


It is easy to see that if y(t,z) is a solution of the Riemann problem, then 
so is p(at,az) for every constant a # 0. So we can look for the solution in 
similarity form: 


plt, z) = v(x/t). 


Substitution in (9.56) gives 


p (EHF eE- 6} =0. (9.69) 


Therefore we can construct a solution y(€), € € R, piece by piece by having 
the first or second factor in (9.69) zero. A constant state is obtained with 


i 


y (£) = 0. Putting the second factor zero gives 


f 


f(s) =€, (9.70) 


which means that y is the inverse of f , and hence exists if f is either 
strictly increasing or strictly decreasing, which is the case if f is either convex 
(f > 0) or concave (f” < 0). If this is not the case, it is hard to solve the 
Riemann problem in general form; the case that f” changes sign will be 
illustrated shortly by an example. Because of an analogy with gasdynamics 
that will become apparent later, solutions of (9.70) are called rarefaction fans 
or erpansion fans. Equation (9.69) assumes differentiability of y. A similarity 
solution with a shock satisfying the jump condition (9.60) is 


Fler) — (Fe) 
Pr — pi l 


plz/t)= pi, z<st; (z/t)=y,, z>st; s= 


Thus, we have obtained three building blocks for obtaining Riemann solu- 
tions: constant states, fans and shocks. These must be pieced together such 
that the initial condition (9.68) and the entropy condition (Def. 9.4.2) are 
satisfied. 


Consider first the case f > 0. There are two possible cases to consider for the 
initial condition (9.68): yp > y and y, < pı. As entropy condition we can 
use (9.64). Hence, if pr > pı there cannot be a shock, and the solution must 
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be pieced together from solutions of (9.69), i.e. constant states and solutions 
of (9.70). Define 


i 


st = f (v1) , Sr = f (pr). 
Then the solution is obviously given by 


plz/t) =p, z<st; ple/t)= pr, £> spt; 


flol) =2/t, sit <a < st, (9.71) 


which can be solved for y because f > 0. In the case of the inviscid Burgers 
equation, f (p) = p, and the solution (9.63) is recovered. In the case p, < pi 
the characteristics intersect (because f” > 0) and we use a shock to solve. 
Using the jump condition (9.60) we find 


y(r/t)=y, r<st; p(t/th=p,, x>st; 
s = {f (pr) — f(y) }/ (er pi). 


This completes the treatment of the case f > 0; the case f” <0 is similar 
and need not be discussed. 


(9.72) 


We see that in the cases f > O and f” < 0 the discontinuity in the Riemann 
solution is separated either by a shock or by a fan, but not by both. However, 
this need not be so for f arbitrary. 


The case that f is neither convex nor concave will be illustrated by the ex- 
ample of the scalar one-dimensional version of the Buckley-Leverett equation 
as given in Sect. 4.2 of LeVeque (1992). The flux function is given by 


2 


p 
Fo) = yale)? 

The physical background is the flow of a mixture of oìl and water through a 
porous medium. Here ¢ represents the saturation of water and lies between 0 
and 1. The graph of f is shown in Fig. 9.10 for a = 1/2. Note that f” changes 
sign at y = 1/2. The solution of the Riemann problem can be established by 
geometric means in the following way. Because of the jump condition (9.60), 
a shock between two states pa and pẹ corresponds to a chord in the f — y 
diagram. This shock is admissible if its corresponding chord is in accordance 
with the entropy condition. If it is not we can go (part of) the way between 
Ya and yy along the graph of f, corresponding to a fan. The procedure will 
become clear by taking an example. Let. y, = 0, yı = 1. We can connect 
these states by an admissible shock between y = 0 and y = y1, where the 
chord coincides with the tangent, cf. Fig. 9.11. Between y = y, and y we 
continue along the graph. This solution satisfies the entropy condition, and is 
the unique solution of the specified Riemann problem. In this particular case 
the solution can be obtained in a more or less explicit analytic form. From 
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Fig. 9.10. Flux function for Buckley-Leverett equation 
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Fig. 9.11. Application of Oleinik’s entropy condition to Buckley-Levertt problem 


(ei) /p1 = f (y1) 


we find yı = 1/3. So to begin with we have shock between y, = 0 and 


— Pr — Pi = /3_]. 
s= aa e T T] 


Between p; and p; = 1 we have a fan satisfying (9.70), i.e. 


p-p? 


pp FO on) 
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This gives implicitly 


p =g(z/t). 

By plotting € as function of p using (9.73) we obtain automatically the plot 

of y versus = z/t. Thus, we have obtained the following Riemann solution 
for the Buckley-Leverett equation: 

y(a/t)=1, 2<0; y(z/t)=g(2/t), 0<2<(V3-1)t; 

y(a/t)=0, 2>(v3—-1)t. 


Figure 9.12 shows the solution for t = 1. Note that we have a shock followed 
by a fan. 


(9.74) 
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Fig. 9.12. Solution of Riemann problem for the Buckley-Leverett equation 


Conservative and nonconservative numerical schemes 


After this discussion of some important analytic aspects of scalar conserva- 
tion laws, we now turn to numerical discretization methods. The occurrence 
of weak (discontinuous) solutions makes a significant difference compared 
to the case of strong (differentiable) solutions. In the latter case, consistency 
and stability imply convergence. But for convergence to weak solutions, these 
conditions do not suffice. Convergence to weak solutions satisfying the jump 
condition (genuine weak solutions) is guaranteed only if the numerical scheme 
is in conservation form (the definition of a conservative scheme is repeated 
below). Furthermore, just as for the differential condition, nonphysical weak 
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solutions are excluded only if the numerical scheme satisfies an entropy con- 
dition. 


The following example shows that a nonconservative scheme may fail to con- 
verge to a genuine weak solution. 


Example 9.4.4. Nonconservative scheme for inviscid Burgers equation 
This example is due to LeVeque (1992), Example 12.1. The nonconservative 
form of the inviscid Burgers equation (9.58) is 


Pt t py, =O. 
The first order upwind scheme is given by, assuming ọ > 0: 
ep = p + AOR (YF — GF) =0, A=T/h. (9.75) 
Suppose 
ep=l, j<0; vf=0, j2>0. 
It is easy to see that the numerical solution is given by 
p=, Vn, (9.76) 


which is obviously wrong. It is left to the reader to show that the exact 
solution is given by 


g(t,z)=1, z<t/2; p(t,z)=0, z>t/2. (9.77) 


We see that with the nonconservative scheme (9.75) the shock moves with 
the wrong speed, namely zero; as show by (9.77), the correct speed is 1/2. 0 


Fortunately, convergence to a weak solution that satisfies the jump condition 
is guaranteed for conservative schemes. A conservative scheme for (9.56) is a 
scheme of the following form: 


1 
dip; /dt + (Fy 4ay2 — Fj-aj2) = 0, 


Py41j2 = F(Qj—ps Pi-p+1: +) Pita) » 


(9.78) 


where the temporal discretization can still be chosen as one likes. Summing 
between j = k and j = m gives 


d © 1 
dt > 9; + p Fm412 = Fk-17/2) =0, 
j=k 


so that $` y; is conserved if the boundary fluxes F412 = Fk-1/2 = 0, which 
is why (9.78) is called a conservative scheme. 
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We will now present a few examples of conservative schemes. Forward Euler 
temporal discretization of (9.78) gives 


rth ph AFP jg — FM yj) =0. (9.79) 


A conservative nonlinear version of the Lax-Friedrichs scheme (9.32) is ob- 


tained with 


n 1 n n 1 n n 
Foe /2 = a (Yi — P+) + zle; + Fia) . (9.80) 


In the linear case, f(y) = cp, and (9.32) is recovered. An obvious conservative 
nonlinear version of the Courant-Isaacson-Rees (or first order upwind) scheme 
(9.33) is obtained by choosing 


Fh p =F) if Fy) 295 


j- j) BL Wi (9.81) 
Faja = Fa) 1E S Fay) <0, 


where gf ij. = (p7 + yf )/2. We will see in Example 9.4.5 that this scheme 
has a serious flaw. A better nonlinear version of the Courant-Isaacson-Rees 
scheme is the Engquist-Osher scheme (Engquist and Osher (1981)). This 
scheme is obtained with 


Pit aye = ft (97) + fT (9541) ’ 


flp) = F(0) + J max{f (%),0}jdy, F(p) = J min{ f (w), 0}dv . 
(9.82) 


That this scheme makes sense can be seen by noting that 
fle) = fte) + J~ (p). Furthermore, in the linear case f(y) = cy the first 
order upwind scheme is recovered. Why (9.82) is a better nonlinear general- 
ization of the first order upwind scheme than (9.81) will be seen shortly. 


There are several conservative versions of the Lax-Wendroff scheme (9.26), 
all of two-stage form. MacCormack’s scheme (MacCormack (1969)) is well- 
known; it is given by 
Pj= 0) + MF (P41) — FeF)} = 0: 
n 1 n * À * * (9.83) 
ppt — 5(F3 + yj) + z Ff (¥}) — f(yj_a)} = 0. 


By elimination of y* it is easily seen that this scheme is conservative. 


Schemes of the form (9.78) are called consistent if 


F(p, P) = f(P) (9.84) 
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and if there exists a constant K such that 


|F (Pj-p: Oj—p41, © Pj+a) ~ (P| < K max{|~j4m — P| : m = =p, na 
9.85 


for Pi-ps, Pj-p+1, -= Pj+q Sufficiently close to ¢. It is left to the reader to 
show that this implies consistency of first order in the classical sense if the 
solution p(t, x) is sufficiently differentiable (cf. Kroner (1997) Lemma 2.2.4). 


The celebrated Lax-Wendroff theorem (Lax and Wendroff (1960)) asserts 
that if the solution of a conservative and consistent scheme converges as 
7T10,h 10, then it converges to a genuine weak solution of the conservation 
law. In other words, we cannot get a shock that moves at a wrong speed. 
This is the merit of conservative schemes. For convergence we need stability, 
of course. An in-depth study of the reason why nonconservative schemes may 
converge to wrong solutions is given by Hou and Le Floch (1994). 


The entropy condition 


In order to ensure that a numerical solution converges to the physically rele- 
vant weak solution (the entropy solution), the numerical scheme must satisfy 
some form of the entropy condition, in addition to being consistent, con- 
servative and stable. LeVeque (1992) (Example 12.3) gives an example of a 
conservative scheme that generates a numerical solution violating the entropy 
condition. We present a slight variation. 


Example 9.4.5. Entropy condition violating scheme for inviscid Burgers equa- 
tion 
Consider the inviscid Burgers equation with initial condition 


y'(z)=-1, 2<0; pYz)=1, z>0, 
with the following corresponding initial data for the numerical solution: 
ep=—-l, j<0; gf=l, j>0. 


Consider the nonlinear version of the first order upwind scheme defined by 
(9.79) and (9.81). Since F? aip = Fija YJ, we obtain pi = p3, and hence 


pi =pj Wn. 


However, this solution does not satisfy the entropy condition. The corre- 
sponding characteristics are given by 


e/t=-1, z<0, e/t=1, z>1, 


and emanate from the shock at z = 0 instead of converging into it. The 
entropy solution is given by 
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plt, z) =—1, <-t; p(t,z)=2/t, -t<2x<t; 
g(t,z)=1, z>t. 


We will show later that the Engquist-Osher scheme (9.79), (9.82) converges 
to the entropy solution. m 


We will not formulate entropy conditions for numerical schemes. For such 
a condition see LeVeque (1992) Sect. 13.4. It turns out that conservative 
monotone schemes converge to entropy solutions. Monotonicity is not difficult 
to check. 


Monotone conservative schemes 


As shown by Theorem 9.4.1, entropy solutions of scalar conservation laws 
have the following monotonicity property: if two initial conditions y°(r) and 
(x) satisfy y? > p°, then y(t,-) > y(t,-). It is nice if numerical schemes 
mimick this property. We therefore define 


Definition 9.4.3. Monotone conservative schemes 
A conservative scheme 


orti oh AE” aja — FP yj) =0 (9.86) 


is called monotone if for any two numerical solutions p} and yẹ we have the 
following property: 


pr > we, Y= pit > yt, vj. 


By rewriting (9.86) in the following form 
git! = H,(p") (9.87) 


we see that the property that H is a nondecreasing function of its arguments 
is equivalent with the scheme being monotone. We have 


Theorem 9.4.3. Convergence of monotone schemes to entropy solution 
The solution of a monotone conservative scheme converges to the entropy 
solution as T | 0 with r/h fired. 


A proof has been given by Harten, Hyman, and Lax (1976) and and Crandall 
and Majda (1980). 


Ezample 9.4.6. Nonmonotone upwind scheme 

Because of the preceding theorem we expect that the upwind scheme of Ex- 
ample 9.4.5 is nonmonotone. That this is indeed the case may be shown as 
follows. Let 
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ygy=—a<0, j<0; pj=b>0, j>0. 
Writing scheme (9.79), (9.81) in the form (9.87) we obtain for b < a: 


l 


1 
H(y) = po — Jit 


1 
7AP = —a + Ala” —b*)>-a. 


Increasing b such that b > a gives 


1 1 
Holp) = po — FApo + 5 Po =—a, 


so that Ho(y) has decreased, whereas one of its arguments has increased. O 


Since, as is easy to see, the first order upwind scheme is monotone in the linear 
case, it comes as a surprise that the nonlinear version (9.81) is nonmonotone. 
The following example shows that a monotone nonlinear generalization of the 
first order upwind scheme is provided by the Engquist-Osher scheme. 


Ezample 9.4.7. Monotonicity of Engquist-Osher scheme 
We will show that the Engquist-Osher scheme (9.79), (9.82) is monotone. 
The scheme can be written in the form (9.87) with 


Hy (yp) = AFT (e512) + By — AFT (3) +AT (3) — AFT (Yj 41) - 
Hence 
9H;/O9;-1 = Amax{f (y;-1),0}>0, 
OH; /Op;41 = —Amin{f (y;41),0} >0, 
0H;/dg; = 1-AXlf (y;)|. 


We see that the Engquist-Osher scheme is monotone, provide the (nonlinear 
version of the) CFL number A|f | satisfies 


MF (p)l<1, Vi, (9.88) 
which can be taken care of by taking the time step r small enough. m 


It is left to the reader to show that the Lax-Friedrichs scheme (9.79), (9.80) 
is monotone, provided (9.88) is satisfied. 


Just as in the linear case, monotone schemes are subject to Godunov’s order 
barrier theorem: their order of consistency in regions where the solution is 
smooth is at most one. This is shown in Harten, Hyman, and Lax (1976). 
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The Godunov scheme 
The Godunov scheme is of the following form: 

pet -7 H + MPF 41/2 -7 Fi 172) =0, F;41/2 = F(p}, pj+1) ' 
Godunov (1959) proposes the following choice for F: 


tn41 


FS. == | Het zisya)dt. (9.89) 
tn 

Here y* is the solution of the Riemann problem with left constant state p7 
and right constant state p741. This gives the exact result for 7 smal! enough 
such that waves coming from adjacent cell faces do not interact, and if we 
define p(tn, £) to be piecewise constant, with y(t,,r) = y? for zj — h/2 < 
£ < z; +h/2. Since y* is a similarity solution: y*(t, £) = p* ((£ — 2j41/2)/t), 
we have p" (t, 2;41/2) = ¢” (0) = constant, so that (9.89) becomes 


F (oF, e341) = f(e" (0) - 


The wave speed is bounded by | Fl, so that there will be no interaction 
between adjacent Riemann problem if 


Af (pe) <1, j= 0,41, 42,... 


This is in fact a restriction on the CFL number, similar to the stability 
restrictions that we found for the linear case in Sect. 9.3. 


Assume f > 0. Recalling the solution of the Riemann problem obtained 
earlier in this section, if y?,, < p} then we have a shock propagating with 
velocity 


s = {fløy — Pep) (p41 — 93) | (9.90) 


so that in (9.89) 
PO = p7p’ 8<0; (=H; , s>0. (9.91) 


On the other hand, if y?,, > p7 then we have a fan satisfying (9.70). In our 
case, E = (£ — 2;41/2)/t, £ = £j41/2, 50 that € = 0, and ¢*(0) follows from 
from the fan solution (9.71) with z = 0: 


e=, f(y) >0; (9.92) 
g“ (0) = p741 ACAR <0; (9.93) 


i 


F (p"(0))=0, Fo) <O0< Ff l) (9.94) 
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This completes the specification of the Godunov scheme. 


Because f > 0, f(y?) > 0 implies s > 0, and f'(y%,,) < 0 implies 
s < 0, with s given by (9.90). Hence, if we were to use instead of an exact 
Riemann solver (as Godunov) an approximate Riemann solver which violates 
the entropy condition and which uses an expansion shock rather than an 
expansion fan, so that we always use (9.90) and (9.91), then only in grid 
points where (9.94) occurs an error is made. Because of an analogy with 
gasdynamics, the case of (9.94) is called a transonic rarefaction. 


In LeVeque (1992) Sect. 13.2 the following equivalent expression is given for 
the Godunov flux: 


F(p}, P341) = min{f (p) : p <es geri} T < Pra] ; 


a n n a n (9.95) 
F(p}, p341) = max{f(¢) : Pit SPS PF} 695 > Pir. 


It turns out that this gives the correct flux for al scalar conservation laws, 
even nonconvex ones, for the Riemann solution that satisfies the entropy con- 
dition. 


It has been shown by Crandall and Majda (1980) that the Godunov scheme 
is monotone. Hence, it satisfies the entropy condition, according to Theo- 
rem 9.4.3. A different proof that the Godunov scheme satisfies the entropy 
condition has been given by Harten and Lax (1981). 


Higher order schemes 


As is easily seen in the linear case, the Godunov scheme is only first order 
accurate, Lax-Wendroff type schemes, such as MacCormack’s scheme (9.83), 
are second order accurate, but do not preserve monotonicity, and show spu- 
rious wiggles near discontinuities. For efficient computation of practical flow 
problems, second or higher order accuracy is desirable in smooth regions of 
the flow, with crisp resolution of discontinuities, while avoiding spurious wig- 
gles near discontinuities. A way to achieve this is by extending the technique 
of flux limiting discussed in Sect. 4.8 to the nonlinear case. 


First of all, schemes that are second order accurate in space in smooth parts 
of the flow have to be developed, generalizing the x-scheme of Sect. 4.8 to the 
nonlinear case. A way to do this has been proposed by van Leer (1979), and 
is generally known as the MUSCL (monotone upwind schemes for conserva- 
tion laws) approach. The idea is not to use the piecewise constant states Pj 
and ;41 as input for the (approximate) Riemann solver F(p}, p41), but to 
extrapolate with a directional bias from the left and right to obtain right and 
left states Ply j2 and Y5 41/2» Which are used as input for the (approximate) 


Rieman solver Fpi PF 41/2): This is to be done in such a way that in the 
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linear case (f(y) = uy) the numerical flux of the x-scheme (9.30) is obtained. 
We therefore write 


_ 1 l-K 
541/2 — 5 (3 + pj4i) + (ein + 29; — p41) 5 


1 l-k 
Pta = 9 (Fi + y141) + a (9 + 20541 - 542) . 


(9.96) 


In the numerical flux function F at x;41/2 the left and right states p; and 
yj+1 are simply replaced by Pti and P5442) respectively. This gives: 


dy; l ~ — 
P + p E Oa Pj41/2) T Flej Pj-1/2)) =0, (9.97) 


with F a consistent numerical flux function, i.e. F(p, ¢) = f(y). This spatial 
discretization can be shown to be second order accurate (cf. Kroner (1997) 
Sect. 2.5). It is left to the reader to show that in the linear case (f(y) = cy) 
the scheme (9.97) reduces to the «-scheme (9.30). 


Van Leer obtained (9.96) by replacing the piecewise constant per cell approx- 
imation of y by a piecewise linear distribution. In order to avoid spurious 
wiggles, (9.96) is not used, but the slopes of the piecewise linear distribution 
are limited. Therefore this class of schemes, to be described next, may be 
called slope limited schemes. 


Another way to obtain wiggle-free schemes is to put a limiter on the numer- 
ical flux function, resulting in fur limited schemes. This approach has been 
pioneered by Harten (1983), and put in a general framework by Sweby (1984). 
It turns out that the principles to ensure the absence of spurious wiggles are 
about the same as for slope limited schemes, and similar limiter functions 
are used. Numerical experiments by Anderson, Thomas, and van Leer (1986) 
show somewhat better accuracy for slope limited schemes as compared with 
flux limited schemes. We will not discuss flux limited schemes. 


Still another way to realize second order schemes without spurious wiggles is 
to add explicitly a judicious amount of artificial viscosity to a second order 
central scheme. This results in artificial viscosity schemes, to be discussed 
later. 


Slope limited schemes 


In order to rule out spurious wiggles, the k-scheme has to be modified. One 


way to do this follows easily from the linear case discussed in Sect. 4.8. As in 
(4.102) we replace (9.96) by 
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Pt i2 = Pj + preset pi)» 


1 Pj Z Pi-1 
(e-em), r= ABE 
pp TP Pj+i— Pj 


(9.98) 





9541/2 = Pj+it T 


We will now derive conditions on the limiter y(r) for the resulting scheme to 
be monotonicity preserving. This is an easy generalization of the conditions 
obtained for the stationary linear case, provided the numerical flux function 
satisfies the following conditions: 


Flo yaa Praa) = E (jaye) tE O ay2) | 


- dFt (9.99) 
dF” (p) 0, 520, WER. 
p 


dp 7 
The resulting semi-discetized scheme is given by 
dy; _ 1/2 
= te Lir- (7) + FHES. (9.100) 
In the case of explicit Euler discretization we get 
pitt — oh + ME (07) + FF (pt) BAe = 0 (9.101) 


The basic idea here is to extrapolate a left state o7 41/2 and a right state 
P3412 from neighboring grid points, to put limitations on PË: /2 in order to 
suppress spurious wiggles, and to base the numerical flux F;41/2 on (approx- 
imate) solution of the Riemann problemRiemann!approximate solver corre- 
sponding to the left and right states Pry Iz 


Total variation decreasing schemes 


In view of Theorem 9.4.3 one would ideally like to have monotone schemes. 
However, this seems to be asking too much for second order schemes. Instead, 
we will be content with a weaker requirement that excludes spurious wiggles. 
Harten (1983) shows (in the one-dimensional scalar case) that a TVD scheme 
is monotonicity preserving. The discrete TVD property has been defined after 
equation (9.19). Hence, TVD schemes share the properties put forward in 
Theorems 9.4.1 and 9.4.2 with the exact solution. For a TVD scheme the total 
variation is obviously bounded (TVB), and TVB schemes are convergent (see 
LeVeque (1992) Sect. 15.3, for example). Hence, TVD schemes are stable. 


We will give conditions for explicit schemes to be TVD. Let the scheme be 
denoted by 


gett = yt + 05 (P41 ~ 97) +b; (GF-1 — YF) (9.102) 


where a; and b; may depend on PF Pp 41s Pitz We have the following 
theorem. 
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Theorem 9.4.4. (Harten (1983)). 
If the coefficients in scheme (9.102) satisfy for all j 


aj>0, 5, >0, a; + bj41 <1, (9.103) 
then the scheme is TVD. 
Proof. It follows from (9.102) that 
gett — path = bilo] — pfs) + (1 bjt — ay) (OF 4a — 99) (9.104) 
+a541(9F 42 — V+) - 
Because of (9.103) the coefficients in (9.104) are nonnegative, so that 
ett — eet < by le? — ej-l t (CL biyi — as lefar ~ 27 
+a; 4119542 - grad . 


Summation over j gives 
TV(y"t4) = Vleet -els 
j 


Sbi + 1 bjp — a3 +alle 1 — oF] = TV"). 


j 
Oo 


We now give conditions for the slope limited scheme (9.101) to be TVD. The 
scheme can be written in the form (9.102) with 


F- (G5 41y2) — FO (5-1/2) Pi 41y2 ~ Pj-1/2 


aj = -À — = z z 
Pit+1/2 T Pj-1/2 Pj+1 P; 9.105 
Ft(pt, sj.) Flota) Shaye ot (9.105) 
b. = Pj+1/2 Pj-1/2) Pj+1/2 — Fj-1/2 
J + + n n i 
Pji+1/2 7 Pj-1/2 Pj — Pj-1 


Since according to (9.99) we have 


dF (¢) dFt 
ra — 
dy <0, dip >0, Ve eR, 


the first fraction in a; is nonpositive and the first fraction in b; is nonnegative. 
In order to have a; > 0, b > 0 according to (9.103), we must therefore have 


+ + 
Pi+1/2  Pj-1/2 
p7 = pj- 1 


2541/2 — Pj-1/2 


> 0 
O41 — PF 


>0. (9.106) 


This is equivalent to (4.109), and conditions on the limiter %(r) to satisfy 
(9.106) have already been studied in Sect. 4.8. The graph of y(r) must lie in 
the admissible region of Fig. 4.11 It remains to satisfy the last part of (9.103). 
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Assume 


+ + 
Piiz T Pj~1/2 


p7 - Pj 1 


Pj4+1/2 7 Pj-1/2 


<M 
vara = pi 7 


<M 


(this is the case if the graph of y(r) satisfies the condition just stated). Define 


FH et 510) — Ft (tay) Elaa) — FO (45-12) 
eyn = j+3/2 GHif2? | j+1/2 j—1/2 <e, Vj. 


+ _ - 
Piya. T P2 Pi+1/2 — Pj-1/2 
Then the last part of (9.103) is satisfied if the time step is chosen small 
enough, i.e. 


a< 


—. .107 
<= (9.107) 


This is similar to a stability condition on the CFL number, since it turns out 
that for practical schemes c;41/2 is related to the wave propagation velocity 


f (p). For example, for the Engquist-Osher scheme we have 


+ 
P 5343/2 
1 i 
C412 = So? max{f (4%), 0}dy 
Pj+3/2 T Pj+1/2 y 
P 41/2 
P 41/2 
1 . ' 
-— $ J min{ f (b), O}dy . 
Pi+iy2 T Pj-1/2 _ 


P172 


Hence 


0 < cj41/2 < ¢ = max{|f (y)|}. 


It now becomes clear why it is reasonable to choose M = 3, leading to the 
admissible region for the graph of y(r) given in Fig. 4.11. In Sect. 4.8 it has 
been shown that for second order accuracy we must have 7(1) = 1, leading 
to M > 2. In order to give limiters some leeway we choose M > 2; in order 
to make (9.107) not overly strict we do not want to choose M large, leaving 
us with M = 3 as a reasonable compromise. 


For first order flux splitting schemes that do not use extrapolation and lim- 
iting the analysis is easier. For the Enquist-Osher scheme we have already 
obtained condition (9.88) for the scheme to be monotone, hence TVD. 


As we remarked before, TVD schemes are stable. Furthermore, scheme 
(9.101) is in conservation form, so that genuine weak solutions satisfying 
the jump condition will be obtained. In order to demonstrate satisfaction of 
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the entropy condition it would suffice that the scheme is monotone, but this 
is not the case. But that correct solutions will be obtained can be made 
plausible as follows. The main defect of schemes that do not satisfy the 
entropy condition is that they allow an expansion shock where the exact 
solution has an expansion fan, and is smooth and monotone. In the case 
f >0, f” > 0 (for example) the exact solution would be smoothly increas- 
ing. Now suppose the numerical solution shows locally an expansion shock, 
with 0 < yj_-1 < pj Yj 41- This brings r; close to zero, hence (rj) will be 
close to zero (cf. Fig. 4.12), and the slope limiter switches the scheme to the 
underlying first order scheme. But this scheme does not allow an expansion 
shock, if it satisfies the entropy condition. Hence, we may be confident that 
slope limited second order schemes will not develop expansion shocks, of the 
underlying first order scheme satisfies the entropy condition. It suffices that 
the first order scheme is monotone, which is easy to check. 


Other temporal discretizations 


It is frequentely desirable to replace the O(r) temporal discretization (9.101) 
by an O(r*) scheme for better accuracy, or by an implicit scheme, in order to 
relax the time step restriction (9.107). We start with the following implicit 
scheme: 

pet + elejti — 977") + dilet- 977") = (9.108) 
PF talpi — 95) + bs (¥F-1 — 97) » 


where a; and p; may depend on 9? , y741, P74, + and cj and d; may depend 


on prt? pit i priz . The following theorem gives conditions for scheme 


(9.108) to be TVD. 


Theorem 9.4.5. (Harten (1984)). 
If the coefficients in scheme (9.108) satisfy for all j 


a;>0, b >0, ajj+b;<1, -wK<C<e,dj; <0 
then the scheme is TVD. 
Proof. Let us denote the scheme (9.108) as 
Ly"t! = Ry”. 
First we show that TV(y) < TV(Ly). Writing y = Ly, we have 


viti — w = d(H; — pj-1) + (1 -cj — dj) (Gj41 — P) 
+ ej4i(Pj42 — Yj41) 5 

Vii — Hj — djlpj — Bj-1) — C741 (Hi42 — YI41) = 

(1 ~ c — dj41) (pina — pj). 
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Taking absolute values, taking the signs of the coefficients into account and 
using the triangle inequality we obtain 


leita — Vy] — dile; — e3-1l — Cj4alyy+2 — Ys 411 > 
(1 — ej — dj41)|j+1 ~ psl. 
Summation over j gives 
TV(h) — $ (dii + e)l- pl > TVo) — So ldi t es) ej41 — yl. 
Hence TV(w) > TV(v), or 
TV(y) < TV(Ly) . 
From Theorem 9.4.4 it follows that 
TV(Ry) < TV(¢) . 
Combination of the last two equations gives the desired result: 
TV(p"t*) < TV(Ly"*") = TV(Re") < TV(p") . 
O 


Let us apply this theorem to the temporal discretization of (9.100) with the 
w-method: 


perth + wR (pot) +wF (ptt) 4 


(1- w) F(p”) + (1—w) Ft (pt) PAN? = 0. 


with F+ satisfying (9.99). This results in 
pitt — walt ont — ght) — wbt (gt — grt) = 
pj + (1 — wal (prs, =- pF) +(1- w)b? (pnj-1 7 p7) ’ 


where a}, b?, ant and bets are given by (9.105) by putting in the appropriate 
time level superscripts. According to the preceding theorem, this scheme is 
TVD if, assuming 0 <w < 1, 


art? >0, bFtt>0, aP>0, bP >0, (9.109) 


and 
(1 —w)(ae + bY) <1. (9.110) 


The explicit case w = 0 has already been studied. Conditions (9.109) also 
have to be satisfied in the explicit case, leading to (9.106), which results in 
the usual restrictions on the limiter y(r). But for w > 0 (9.110) is weaker 
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than the last condition of (9.103), and it is easy to see that (9.107) is replaced 
by the weaker time step restriction 


1 

AS Ta w)eM * 

The price to pay is of course the need to solve a nonlinear system per time 
step. If we linearize a”+! and b"+! in some fashion, the system to be solved 
becomes linear, but is no longer conservative. This makes it less suitable for 
time-accurate computation of time-varying flows, but for time-stepping to 
steady state with large time-steps (taking w close to 1) a linearized scheme 
is fine, since the steady-state version of the scheme is conservative. Further- 
more, the matrix of the system to be solved is obviously diagonally dominant, 
provided the inequalities (9.109) are maintained after linearization. 


The preceding implicit scheme is O(7?) accurate for w = 1/2. Higher order 
accuracy may also be obtained with explicit schemes. For brevity we rewrite 
the semi-discretized scheme (9.100) as 


dp 1 
Ry); = {Fo (97) + FHH 


where ọ is the algebraic vector with elements y;. Let us consider Runge- 
Kutta methods for temporal discretization. A two-stage Runge-Kutta method 
(known as the method of Heun) is given by: 


po =e" +AR(¢"), 


9.111 
prt =p" + SMRC") + RC") me 


We will show that this scheme is TVD (hence stable) under certain conditions. 
The scheme can be rewritten as a succession of two explicit Euler steps: 


g = vy" +XR(p"), 
gy = p +AR(¢"), 


li *% 
pt = ale" +e"). 


Conditions for the explicit Euler scheme to be TVD have already been given. 
If these conditions are satisfied, then 


TV(g"") < TV(g"), TV(e") < TV(¢") . 


Since 


n 1 n 1 +*+ 
TV(p"*") < ZTV") + ZTV") 


m 
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the Runge-Kutta scheme (9.111) is TVD under the same conditions as the 
explicit Euler scheme (9.101). If the spatial accuracy is O(h?), the temporal 
accuracy need not be better than O(r?), so (9.111) is an attractive time- 
accurate scheme. 


If the spatial accuracy is third order, for example when a limited x = 1/3 
scheme is used, then it may be attractive to have third order accuracy in 
time as well. The following Runge-Kutta method 


p =e" +rAR(p"), 


oe 3 l, Àp * 
p= ge tge taR), (9.112) 
1 2 2A 
n+l ` on A kk an xk 
gT = zp" tae +o Re), 


is shown in Shu and Osher (1988) to be third order accurate in time and TVD 
under the same condition under which the explicit Euler scheme is TVD, i.e. 
(9.107). A general study of TVD Runge-Kutta schemes is presented in Shu 
(1988), and the subject is also considered in Hundsdorfer, Koren, van Loon, 
and Verwer (1995). 


This completes our general discussion of higher order TVD schemes for scalar 
conservation laws. Specific schemes may now be formulated by combining an 
approximate Riemann solver (i.e. a numerical flux function) with a limiter 
function, corresponding with some value of x; a number of limiter functions 
are discussed in Sect. 4.8. But we will discuss particular schemes and numer- 
ical results not for the scalar case, but for the Euler equations, in Chap. 10. 


Artificial viscosity schemes 


The TVD schemes discussed before belong to the class of approximate Rie- 
mann solvers: the numerical flux F'.1/2 18 related to the approximate so- 
lution of the Riemann problem based on two constant states p7 and y? 1, 
or Pry j2 and yi) j2 in the case of second order slope limited schemes, In 
essence, these are generalizations of upwind biased schemes to the nonlinear 
case. To combine second order accuracy and the TVD property, one may also 
try to modify central schemes, such as the Lax-Wendroff scheme, by adding 
a solution-dependent amount of artificial viscosity. A widely used scheme of 
this type has been developed in Jameson, Schmidt, and Turkel (1981), Jame- 
son (1985b), Jameson (1985a), Jameson (1988), and is generally known as 
the Jameson-Schmidt-Turkel scheme or JST scheme for short. The artificial 
viscosity has to be judiciously chosen, and has evolved over the years. Jame- 
son (1985a) proposes the following scheme. The conservation law (9.56) is 
semi-discretized as follows: 


9.4. Scalar conservation laws 389 


dy; 1l j+1/2 
a tp) + Bhaja 0. (9.113) 


Here F'(p) is a central approximation of the flux: 


F (saya) = SUF (es) + lesa) - 


The term d represents artificial dissipation, needed to suppress spurious wig- 
gles. The term d(y) must be carefully chosen to ensure crisp resolution of 
discontinuities. Jameson, Schmidt, and Turkel (1981) define d as follows: 


2 
dj4ij2= ae) Riss/2(eia1 — pj) + 
4 
e$ P1/2Rj+1/2(2342 — 3pj+1 + 39; — pj-1}- 


Here e?) and ¢4) are solution-dependent coefficients, and Rj41/2 18 a factor 
to give d;41/2 the proper dimension and scale: 


1 i i 2 ` 1 — 

Rjsiya = FU (vi) tS (pj+1)}, i= min{ 3, kD; 41/2} , 
41/2 = max{0,k4 —adj41/2}, Dj41/2 = max{vj42, Vjt Vj, Vj-1} 5 

vy = 2m pi +P) paa, DL, 
Pi+ı + 2pj + Pj- 32 
The quantity v; is meant to be a sensor of shocks. When it is small, ¢'?) is 
small, and e{4) = k(4), giving only an O(h°) dissipative flux d. Near a shock 

a (2) 

vj œ l and E; 41/2 
order accurate. 


œ% 1, and the artificial viscosity makes the scheme first 


Although the preceding spatial discretization appears to work well in practice, 
it does not seem to be TVD. Jameson (1988) presents an artificial viscosity 
scheme that is TVD, based on the following train of thought. 


Let the artificial dissipation be given by 
dj41/2 = €543/2 — 2€;41/2 + €j-1/2 5 
where 
€j41/2 = Qj41/2(Pj+41 Pi); Qjp >O. 


Noting that 


1/2 1 ; 1 1 1 
FG = Fle)" + af lei = gouel- 5¢-1pel}a 


with 
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fleit) — F(y;) ! 
C5 41/2 = L og EP Ga =f le), PEP, 
Pj+1 — Pj 


we can rewrite the scheme (9.113) as follows: 


dp; 1 . 1 j4+2 
h- =— zee + 3 j-1/22l4-1 ~ aj+a/20/5" + 
ad . ;— 
3aj41/2~2" — 8a;-1/2¢l}_4 + aj-3/29|}-2 ` 


It can be shown that for {y;(t)} to be TVD it is necessary that a,;43/2 < 0, 
which is not the case. Therefore a limiter y(r) is introduced in the artificial 
dissipation, redefining dj41/2 as 


1 €j-1/2 
dj41/2 = Y(ri+1)ej+3/2 — 2e; 41/2 + y{—)ej-1/2 , T= “j-1/2 . 
Tj €j+i/2 


Noting that €743/2 = €j41/2/"j+1 and €;-3/2 = Tj—16j-1/2 the scheme be- 
comes 


, 1 
Z = —iejpph t + 50)-1/2Plj—-1 t+ O541/2{2 + P(r; )— 
1 ; 1 1 
— y(r) jekt — aj-172{2 + Y>) — rj- eha - 
Ti4i Pj Pj-1 
(9.114) 

It is easily shown that {y,(t)} is TVD if the coefficient of pyr is nonnegative 
and the coefficient of pl is nonpositive. This gives 


1 1 
aj41/2{2 + V(r) — maa erat) ~ 565 41/2 > 0, 
3 


1 1 i 
aj-ip{2 +Y) — rj-1¥(——)} -~ 9¢j-1/2 2 0. 
J 37 


Choosing 
1 
Q;41/2 2 5 |¢i+1/2I , 

we must have 

i 

2+4(r)—--y({s)>0, YrsER. 

5 

This satisfied if 
Wr) > M-1>-2, Wr) <(1+M)r. (9.115) 


Hence, it is sufficient if the graph of y(r) lies in the admissible region of 
Fig. 4.12. Jameson (1988) proposes: 
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pir) =0, r<0; Pr)=r, O>r>1; V(r)=l1, r>1. 


Note that this makes the semi-discretized system (9.113) TVD, but the TVD 
property can still be destroyed by the temporal discretization. Further de- 
velopments of TVD versions of the JST scheme are described in Jameson 
(1995a), Jameson (1995b), Kim and Jameson (1995). 


Jameson, Schmidt, and Turkel (1981), Jameson (1985b), Jameson (1985a), 
Jameson (1988) consider only time-stepping schemes designed to reach steady 
state quickly, without regard for temporal accuracy. The semi-discretized 
scheme (9.113) can be denoted as 


d 

“F+R(y)=0, 

where y(t) is an algebraic vector containing the unknowns gj. Runge-Kutta 
schemes are selected for temporal discretization. The following class of m- 


stage Runge-Kutta schemes is considered, to step from t = nt tot = (n+1)r: 


= ¢", 


eeeeneente  seeteseeeannnen 


6 
3 
| 

= 
H 

6 
2 
| 

D 

3 

{= 

~ 

s 
3 
2 


(9.116) 
yl) — vl) — Rim) | 


where 
k k 
Re = Ser R(t) So Yer —1., 
=O r=0 


The coefficients 3, are chosen not to achieve temporal accuracy, but to tailor 
the stability domain such that a large time step 7 can be taken. Because the 
symbol of the linearized differential equation is purely imaginary (it is left 
to the reader to check this), 3, is chosen to maximize the stability interval 
along the imaginary axis (Kinmark (1984)). Because of the artificial dissi- 
pation, the symbol of the (linearized) scheme is not purely imaginary, and 
stability cannot be completely guaranteed, the more so because the scheme 
is nonlinear. When we have reached steady state, we have solved 


R(y) =0, 
which can be written as (cf. (9.114)) 
a; (pj41 — 93) blej- 97) =0, jS, dJ. (9.117) 


Boundary conditions are left unspecified. The following theorem shows why 
the limiter introduced in (9.114) is useful. 
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Theorem 9.4.6. If 
a; > 0, b; > 0 (9.118) 
then the solution of (9.117) is monotone. 


Proof. We can write 


bj 


aj 
pntg aj +b iT 1; jol,. nJ — 1. 


=o 


Since 0 < —2— < 1 we have 


a, ET da i 
min{yj—1, pj4i} < pj < max{ypj-1, pj41} 


therefore p; cannot be a local extremum. This applies for all j, so that {y;} 
is monotone. z 


Hence, if (9.118) is satisfied then spurious wiggles are excluded. Conditions 
(9.118) are satisfied if the limiter y(r} in (9.114) satisfies conditions (9.115). 


Further specifications of the time-stepping scheme are given in Jameson 
(1985b). For efficiency, the dissipative part of the flux is treated different 
from the convective part, defining R'*) as follows: 


R(p); = Ql); + Dle); , 


1 -117/2 lo 
Ql) = al D= —zdjjt i 


RE) = Limot 4 kr D (p ")}, 


dover = 1, i=l. 
r=0 


In Jameson (1985b) it is made plausible that we have stability in the case of 
four stages (m = 4) with 


1 4 5 


= dup = 1 = — = — = —., 
Ykk = kk , Pa z? Bo 5° fer} 9 


These coefficients 8 maximize the stability interval along the imaginary axis, 
as shown by Sonneveld and van Leer (1985). For efficiency, the dissipative 
term is computed only once per time-step, leading to 


R®) = Q(p™)) + Dip) . 


9.4. Scalar conservation laws 393 


A plausible stability condition is found to be that the CFL number satisfies 
T,,f n 
o = max{TIf (PP) < 26. 


In order to increase 7 further, so that fewer time-steps are needed to reach 
steady state, Jameson and Baker (1984) introduce residual averaging. That 
is. at each stage the residual R(*) is replaced by R(*), defined by 


-ERP (9);_1 + (1+ 2e)R™ (p); — eR (y)j41 = RO p). (9-119) 


The scheme can be made stable by choosing 


e> (2-0), 


where o* is the CFL number limit for the scheme without residual averaging. 


Solving (9.119) is not expensive. After convergence, R = 0, hence R = 0, so 
a solution of the original scheme is obtained. 


For time-accurate solutions, the time-stepping scheme needs to be TVD. This 
has the additional advantage that stability is guaranteed. Examples of TVD 
Runge-Kutta methods have already been given in (9.111) and (9.112). TVD 
multistage schemes have been studied by Shu (1988) in general, and for the 
JST scheme by Jameson (1995a), Jameson (1995b), Kim and Jameson (1995). 
We will not go into this further. 


More dimensions 


The multi-dimensional case offers no essentially new aspects compared to the 
one-dimensional case. The scalar conservation law is now given by 


ve OF" (y) 
= t ~ = Í... 
HÈ fg >0, £ ER, Q l, ,d, 


where d is the number of space dimensions. The summation convention does 
not apply in the remainder of this section. The total variation TV(y) is 
defined as follows, in the two-dimensional case (for brevity): 


TV (y(t, -)) )) = lim sup = | fi lolz! +e, 27) — p(z',2*)|dx'dz? . 


tims? f f lp(r!, £? +e) — v(x, x?) |delde? . 
ceļ E 
The extension to three dimensions is obvious. It can be shown (Crandall 


and Majda (1980)) that » is TVD. The total variation of a grid function 
Pj; J = (Ji, Ja) on a d-dimensional grid is defined analogously: 
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=). pa lPj+2ea — ¥5 
ji 


where e1 = (5, 0,...,0), €2 = (0, T 0, ...,0} etc. Just as in the one-dimensional 
case, experience shows that we get satisfactory results if the numerical scheme 
is TVD, and again the TVD property implies stability. We will therefore an- 


alyze conditions under which numerical schemes are TVD. 


We will study slope limited schemes, and generalize (9.100) to more dimen- 
sions as follows: 


A D p (Fale) FOON =O, 


Jala 


where Pr te, are extrapolated values obtained from ¢ in neigboring grid 
points in the x%-direction in exactly the same way as in the one-dimensional 
case. In the case of explicit Euler discretization we obtain 


grt — oh tS {Fr (97) + Ft (et) ites =0, Ae =T/ha- (9.120) 


We will now give conditions for this scheme to be TVD, in a similar way as 
in the one-dimensional case. To this end we rewrite the scheme as follows (cf. 
(9.102)): 


pert = oP +S {at (Pipen — PF) HDS ea — PF} - (9.121) 
In the same way as Theorem 9.4.4 it can be proved that this scheme is TVD 
if 


af >0, BF >0, X (af +b% e) <1. (9.122) 


& 


Comparison of (9.120) and (9.121) shows that we can write 


Fa (Pjte,) — Fa (Pj-ea)  Piten —Pj-ea 


a“ = —À 


3 oO — — ki 
Pipea T Pi-eg Piten p7 
+ (int + + + 
p% d Fy (Pipen) ~ FE (P7 en) Pipeau = Pi-ea 
je + + nm _ jan ’ 
Pitea T Pj-ea Pj T Pj-2ea 


We assume that the numerical fluxes satisfy (cf. (9.99)): 





dF (¢) dF (p) 
dip <0 dp 





>0, VER. 


In order to have a¥ > 0, b7 > 0 we must therefore have 
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Pites 7 Pie, 


+ ot 
Pipex Pi eax > 0 
Piteo — a 7 


> 0 
pF ~~ Pi 2e, 


This will be satisfied if we use flux limiting with a limiter y(r) whose graph 
is inside the admissible region of Fig. 4.11. It remains to satisfy the last part 
of (9.122). Define 


_ Fi(¢tise,) — Ft efi.) Fa ize.) — FT (Yj-c.) 


am 
Here, = + + = Z 
Pjt3ea ~~ Pitea Pites ~ Pjm~ea 


Assume 


- - + 
Pitea 7 Pj-ea Piten 7 Pi-ea 
Pitea Pinta <M, Sitea fiee <M, 
Pj42eq -Pj j 


n 
Pij Pij-~eq 


which is the case if the limiter satisfies the usual conditions. Then the last 
part of (9.122) is satifsied if 


So A Hite, <1/M. (9.123) 


As in the one-dimensional case we usually have 





for some y, so that (9.123) can be interpreted as a condition on the CFL 
numbers in the various coordinate directions, which can be satisfied by tak- 
ing the time step 7 small enough. 


It has been shown that, unlike the one-dimensional case, in two dimensions 
TVD schemes cannot be formally second order accurate (Goodman and Lev- 
eque (1985)), and presumably this holds also in the three-dimensional case. 
Nevertheless, schemes of the type considered above using a one-dimensional 
slope limited flux splitting method in each coordinate direction usually give 
second order accurate results in smooth parts of the flow and give non- 
oscillatory and sharp solutions near discontinuities. 


Exercise 9.4.1. Show that if a weak solution is in C}, then it is a classical 
solution. 


Exercise 9.4.2. Prove (9.61). 
Exercise 9.4.3. Show that (9.62) satisfies the jump condition. 


Exercise 9.4.4. Solve the Riemann problem for the Buckley-Leverett equa- 
tion with pr = 1, yı = 0. 
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Exercise 9.4.5. Prove (9.77). 


Exercise 9.4.6. Show that the local truncation error for (9.78) tends to zero 
with the mesh size, if F;41/2 satisfies the consistency conditions (9.84) and 
(9.85), assuming the solution y(t, x) to be differentiable (cf. Kröner (1997) 
Lemma 2.2.4). 


Exercise 9.4.7. Show that under suitable restrictions on A (which?) the 
Lax-Friedrichs scheme (9.79), (9.80) is monotone. 


Exercise 9.4.8. Show that in the linear case (f(y) = cy) the scheme (9.97) 
reduces to the first order upwind scheme, if for F the Engquist-Osher flux 
(9.82) or the Godunov flux (9.95) is chosen. 


Exercise 9.4.9. For what reason can the proof that the Engquist-Osher 
scheme is monotone (Example 9.4.7) not be extended to the scheme (9.81)? 


10. The Euler equations in one space 
dimension 


10.1 Introduction 


Since almost all computing methods for the Euler equations in two and 
three space dimensions rely heavily on techniques developed for the one- 
dimensional case, we devote a full chapter to the one-dimensional Euler equa- 
tions. In the one-dimensional case many interesting analytic aspects can be 
brought to light, and this we do first. The shock tube problem is a very useful 
test. problem for discretization schemes; we will present its analytic solution. 
Then we turn to discretization methods. 


More extentive introductions to numerical methods for the Euler equation- 
sare given by Godlewski and Raviart (1996), Kroner (1997), Laney (1998), 
Majda (1984), Toro (1997), Smoller (1983), Hirsch (1990). 


10.2 Analytic aspects 


Useful books about analytic and physical aspects of the Euler equations are 
Liepmann and Roshko (1957) and Landau and Lifshitz (1959). 


In one dimension, the Euler equations (1.76)—(1.78) become, without heat 
addition and body force, 


p m 
U+fU)e=0, U=| m |, f= m?/p+p(p,e) |, (10.1) 
pE mH 


where m = pu with u the flow velocity, E = e + tu?, H = h+ tu’, 
h = e + p/p. We assume a perfect gas, and recall that in this case we 
have p = pRT, e = e(T), c, = de/dT. Hence h = A(T), cp = dh/aT, 
R = cp — ĉ&s, p = (y — l)pe, y = Cp/ey. These relations will be used 
frequently. 


The difference with the preceding chapter is, that now we have a system of 
conservation laws. 


398 10. The Euler equations in one space dimension 


Homogeneity of the flux function 
Suppose the equation of state is given by 


p = pgle) . 


Note that the perfect gas law is a special case, since e = e(T). For the flux 
function f we can write 


U2 
fv) = (02/0, tUg) } (10.2) 
U2U3/U; + Vrg(e) 


Since e = e(U) = (U3 — $U3/U1)/U1, we have have e(pU) = e(U), with p an 
arbitrary real number. Then (10.2) shows that 


f(wU) = pf (U). 


A function with this property is called homogeneous of degree one. Differen- 
tiation with respect to p and taking u = 1 results in the following interesting 
consequence: 


where f is the Jacobian of f, i.e. the matrix with elements ô fp /0U;. 


The Jacobian of the flux function 
If U, exists, equation (10.1) can be rewritten as 

U, + f (UU, =0. (10.3) 
From Sect. 2.2 we recall that the system is hyperbolic if f has a full set 


of eigenvectors. We will therefore solve the eigenproblem for f. Assume a 
perfect gas, so that 


p = (7 — 1)(U3 - 5URIs) , 


and f is easily found to be 


| 0 1 0 
f= 1 y? (3 — y)u y-1 
(y— lju? — yuk yE-3(y~-1)u? yu 


For later reference, we note that f can also be written as 
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0 1 0 
f=| tv (3-y)u y-1]. (10.4) 
thie ult H- O2 yu 
This follows from the identity 


—1 
yE = H+ u?. 


Transformation to nonconservative variables 


In order to facilitate solving the eigenproblem for f we transform to new 
dependent variables, namely 


p 
W=|[u]. (10.5) 
p 
We have 
Wi 
U= Wi W2 . 
sa Ws + ¿Wi W2 


The Jacobian Q = 0U/0W and its inverse are found to be 


Equation (10.3) can be rewritten as 
QW: + f QW, =0. (10.7) 


Premultiplication with Q71 gives the following nonconservative form of the 
Euler equations: 


Wi+fW.=0, (10.8) 
~i 1;' u P 0 
f =Q7fQ=|0 u 1/p |, #=yp/p, 
0 pe? u 


where c is known to be the speed of sound waves (see Sect. 1.12). The eigen- 
values of f (and hence of f ) are easily found to be 


Ay=u-—c, A»=U, Ag=ute. (10.9) 
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° . aft 
The corresponding eigenvectors of f are 


` 1 —p/e ` 1l ` 1 p/e 
Ri = = 1 , Ro= [0], R=-| 1 |. (10.10) 
2 2 
—pe 0 pc 


These are linearly independent. Hence, the system is hyperbolic. If the eigen- 
values are distinct (as in our case), the system is also called strictly hyper- 
bolic. 


Characteristics and Riemann invariants 


Define the matrix R by choosing its columns to be Ry 2,3: 


We find 


g 0 1 —1/pe 
R= | 10 -1/e? |. (10.11) 
01 


Premultiplication of (10.8) by R71 gives: 


1 
ur + (u = c)us — zz {Pe + (u—c)ps} = 0, (10.12) 
1 
Pt + Upe — (Pe + Ups) = 0, (10.13) 
1 
te + (u + c)us + Tipe + (u+ e)p} = 0. (10.14) 


The entropy is defined by relating small changes in the entropy (here to be 
denoted by S) to small changes in e and p by (1.50): 


TóS = de + på(1/p). 
For a perfect gas this can be rewritten as 


óS = e&ôln > — Rólnp = cô lnp — cpô In p = cô In pp? . (10.15) 


This defines S up to a constant, which is chosen such that, assuming c, = 
constant, 


S=cy,|Inpp 7. (10.16) 


Furthermore, continuing with (10.15), 
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You, P Yeu , l 
65 = (L êp ~ fp) = 6p — 6p) . 10.17 
7 Co p— 6p) 7 (5 p— dp) ( ) 


Hence, using (10.13), 


Yeu 


1 
S:+uS, = F {za (Pt + ups} — pi — ups} = 0. 





Therefore, as long as the unknowns are differentiable, the entropy S is con- 
stant along particle paths. When a shock is crossed a fluid particle undergoes 
an increase in entropy. If we assume that we are locally in a region where 
the particle paths carry the same entropy, so that S is constant locally, then 
equations (10.12) and (10.14) can be rewritten as follows. From (10.16) it 
follows that 


p=kp', k=exp(S/c,). (10.18) 
Hence c? = kyp’—! and if S, hence k, is constant we have 


1 2 
—dp = — 
pe y—-l 





This means that (10.12) and (10.14) can be rewritten as 











Hence, we have found that u +2c/(y-— 1) is constant along the characteristics 
defined by 


dr/di=utc, (10.19) 
and S is constant along characteristics defined by 
dx/dt=u. (10.20) 
The quantities u + 2c/(y — 1) and S are called Riemann-invariants. As we 
shall see, these are useful for constructing analytic solutions. 
Eigenvectors of Jacobian 


For later reference, we write down the eigenvectors Rp, p = 1,2,3 of the 
untransformed Jacobian f (U). They follow from R, = QR,: 
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1 
Ri = łp 1—u/e R = u , 
(10.21) 


Let R be the matrix with columns Rj, Re, Rz. For later reference, we deter- 
mine R71. We have R = QR, hence R7! = R~1Q7!. From (10.11) and (10.6) 
we find: 


-304 1$) F040 - D2) -O- Dos 
R= jg W-)DS -0-14 (10.22) 
—F(1- rie) 1 —(y-1)3) 0-1) 


Boundary conditions 


As in Chap. 8, boundary and initial conditions that lead to a well-posed 
problem for the Euler equations are precisely those that determine implicitly 
or explicitly the Riemann invariants. As in Chap. 8, the general rule is, that 
the number of boundary conditions in a point on the boundary must be equal 
to the number of incoming characteristics at that point. Let U(0, z) be given, 
and let the domain be x € (0,1). Consider the boundary x = 0. If u > 0 and 
u +c > 0 (supersonic inflow) three conditions must be given that determine 
U(t,0). If u > 0 and u—ec < 0 (subsonic inflow), two conditions must be 
given that determine the Riemann invariants corresponding to dr/dt = u 
and dz/dt = u + c, ie. S and u + 2e/(y — 1). Ifu < 0 and u+e > 0 
(subsonic outflow), then u + 2c/(y — 1) must be given. If u < 0 and u + 
c < 0 (supersonic outflow) then no boundary condition is to be given. The 
numerical implementation of these conditions is not straightforward, since in 
general u, S and c are not used as unknowns, but p, u and e. A good approach 
in practice is to prescribe u and e at an inflow boundary, and the pressure p 
at a supersonic inflow boundary and at a subsonic outflow boundary. 


This leaves us with the following difficulty. In many numerical schemes the 
density is required at an inflow boundary, since together with the inflow 
velocity this determines the mass flux. However, as just said, at a subsonic 
inflow boundary we prescribe only the temperature; the pressure and the 
density have to follow from the solution. This difficulty may be resolved by 
taking the boundary density from a previous time step or iteration, in a time 
stepping or iterative solution procedure. 
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Shocks and Rankine-Hugoniot conditions 


As we saw in the preceding chapter on the scalar case, when characteristics 
intersect, a shock is formed. Obviously, the characteristics of type (10.20) 
cannot intersect, since they are particle lines. But the characteristics of type 
(10.19) can intersect, so that shocks may be expected. Weak solutions may be 
defined in the same way as for the scalar case (Sect. 9.4), and exactly in the 
same way as in Sect. 9.4 it can be shown that the following jump condition 
must hold across a shock: 


s[U] =[F(U)], (10.23) 


where s is the speed of the shock and [-] is the size of the jump. This gives, de- 
noting quantities immediately to the left and right of the shock by subscripts 
1 and 2, respectively, 


8(p2 — pı) = m- m, (10.24) 
s(mz — mi) = m3/p2 + p2 — mi/pi — Pr, (10.25) 
8(p2 Ez — pı Eı) = mz tHo — mı Hı . 


These are known as the Rankine-Hugoniot conditions (Hugoniot (1889), 
Rankine (1870)). 


Several useful properties and relations can be derived from these conditions; 
see Liepmann and Roshko (1957) (Sect. 2.13) for more details. We start with 
the case of a stationary shock: s = 0. We have mz = mı. Dividing (10.25) by 
mı = m2 we get 


Uy — ug c? /yu2 — c? jyu . (10.26) 
Since h = cpT = c?/(y — 1), Hı = H; can be rewritten as 


1 1 1 i 
gui + Sy = gta t ye: (10.27) 








The sonic condition is defined as the condition in which u = c, and is denoted 
by an asterisk. Hence (10.27) can be written as 


Lea 
gut y-1 


1 1 1 
ci = 5+ 2 Ytl 


y—-1? y-1° . 





(10.28) 


Solution of cı and c3 from (10.28) and substitution in (10.26) gives after some 
manipulation 


urug = c2. (10.29) 


This interesting relation (called the Prandtl-Meyer relation) shows that either 
|ui| > c, and |u2| < cx, or vice-versa, so that the flow at one side of a shock 
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is always supersonic (|u| > c) and at the other side subsonic (|u| < e). (In 
more space dimensions, this remains true for the velocity component normal 


to the shock). 


We now show that the ratios of all quantities depend on only one parameter, 
for which we first take the Mach number Mı = u1/cı. Defining M* = u/c’, 


we get from (10.28): 


. 2 (M*\* y+: 
M$? + (7) = 7 5 





y-T\Mi y-1 
hence 
My” = (y+ DMi/{(y- 1)Mj +2}. 
We have 
ui Ung #2 
us ujuy $? 


so that (10.30) gives 


wy = (Y+ MA0 — IMG +2}. 


From mı = mz it then follows that 


PY = {(y - 1)M? + 2}/(y + DM2. 


p2 
From (10.25) with s = 0 we obtain 


P2 — Pi = piuilur = uz) ’ 


OT 


2 
P2 Pry u2 2 

— — | = —— (1 - —) = yMi(1 - —). 
pı Pi ( a? Y i ny? 


Substitution of (10.31) gives 


p2 2y 2 
| = —- (Mi - 1), 
pı 771l 1—1) 
or 
Pi _ yt+1 


p2? 1—y+2yM?- 


(10.30) 


(10.31) 


(10.32) 


(10.33) 


We see the sizes of the jumps depend only on one parameter Mı. Solving 
Mj from (10.33), we can also express the jumps in p and u in terms of the 


parameter p;/pz. The result is: 
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1 
uz Pl aP+1 P2 





=I: 


In the case of a moving shock (s # 0} there are two free parameters. This 
case can be obtained from the above case of a stationary shock by a Galilean 
transformation £z —> x — st. That is, we switch to an observer who travels 
with speed —s, and who sees the shock move with speed s. The thermody- 
namic quantities do not change under this transformation, but u; and uz 
have to be taken relative to the moving shock, so that u, and uz have to be 
replaced by ui — s and uz — s in the jump conditions above: 


uj—s Pta pa PL 


= =—. 10.35 
ug — § aP+1’ Pa (10 ) 





We now have two free parameters, for which we can take P and s. Alterna- 
tively, s can be obtained from (10.35) as 





Another expression for the shock speed can be obtained as follows. For the 
case of a stationary shock we have (10.33), which can be rewritten as 
y+1 


M? = 14+ (B 1), 10.36 


with M? = u?/c?. For a moving shock this becomes M2 = (u1 —s)*/c?. Hence 


_ y+1l m 
(s — wuy = c?{1 + a p — 1)} . (10.37) 


Note that subscript 1 refers to the low pressure side of the shock. 


The entropy condition 


As in the scalar case, discontinuities do not only have to satisfy the jump 
condition (10.23), but also the entropy condition, in order to be physically 
realizable. In the present case we can check the behavior of the entropy di- 
rectly, and require that the entropy of a fluid particle does not decrease as it 
traverses a shock. From (10.16) we have 

l y 


l — 
m y-1 








S=c,Inp—e,lnp= R{ In p} , 


hence 


-1 
J (S2- Sı) =m yin. 
R pı pı 
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First, consider the case of a stationary shock. Substitution of (10.32) and 
(10.33) and writing M? = 1 + € gives 





-1 
1 (Sp — S1) =In(1 + 21) — yin(1 +€) + yln(1 + 22) , 
where 
2+ ¢ y-—1 
zi = ——E, = —— 
to ytd) F yl 


Expansion for |e} < 1 gives 


y-1 


= (S2~ $1) = 277 = Des 4 ety, 


3(y +1)? 

Now assume wu, > 0, and let subscript 1 denote the state immediately to the 
left of the shock. Hence, before entering the shock a fluid particle has entropy 
Sı, and immediately upon leaving it has entropy S2. The entropy condition 
requires that Sz > Sı. Hence we must have e > 0, or 


Mj>1. (10.38) 


We see that the entropy condition is equivalent to the statement, that a fluid 
particle can only enter a shock from the supersonic side. (We saw earlier that 
a shock always separates a supersonic and a subsonic state). In the case of 
a moving shock this remains true for the velocity of a fluid particle relative 
to the shock. This means that an observer that moves with a fluid particle 
with the flow velocity u sees a shock coming on always at supersonic speed. If 
this is not the case, transition from one state to another does not take place 
through a shock but through a contact discontinuity or an expansion wave. 


Equations (10.38) and (10.36) imply pz > pı. Hence a fluid particle that 
crosses a shock never undergoes a drop in pressure. 


Contact discontinuities 


Since the characteristics of type (10.20) (particle paths) cannot intersect 
themselves, they do not give rise to shocks. But since the corresponding 
Riemann invariant S can be different for different particle paths, they can 
support a jumpin S. The discontinuity travels with speed 


8 = Ui =U), (10.39) 


with subscripts 1 and 2 denoting the states adjacent to the discontinuity. It 
is clear that a fluid particle cannot cross this type of discontinuity, and two 
bodies of fluid that are initially in contact at the discontinuity remain so at 


10.2. Analytic aspects 407 


later times; hence the name contact discontinuity. Substitution of (10.39) in 
the Rankine-Hugoniot conditions (10.24), (10.25) gives 


P2— Pl = P2—- Pl, 
s*(p2 — p1) = 3 (p2 — pi) + p2 — Pi - 


It follows that pı = pz, but p can jump. The corresponding jump in entropy 
follows from (10.16): 


S2 — Sı = cp In pi/ pe . 


Simple waves and expansion fans 


A simple wave is a part of the domain where the solution is smooth and where 
S and one of the other two Riemann invariants are constant. For example, 
let 

2c 2C1 


S= $, u + = ü + 


-T =i (10.40) 








Because the solution is smooth we have 
2c 
y— 1 





dx 
u — = constant along HUTS: 


Therefore along a characteristic u + 2c/(y — 1) = constant, and hence both 
u and c are constant. As a consequence, the characteristics are straight in a 
simple wave. 


A simple wave which is a similarity solution, i.e. a solution of the form 
U=U(g), €=2/t 


is called an expansion fan. We will give an explicit solution. Suppose to the 
left of the expansion fan we have a constant state, denoted by subscript 1: 


U(gy=Ur, E<h. 
Because u and c are constant along dz = u—c, we have along a characteristic 
u-—c=2x/ft. (10.41) 


By solving u and c from (10.41) and (10.40) we get 








u 2 Cta) + H 
z — | — — u 
yL VT yI? 
y-1 £ 2 
c= — = . 
yy p+ oye 
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By using £; = uy — cı this can be rewritten as 





y-1 
1 


€-&), coal = oS 


u — ui = =a (£1 — £) . (10.42) 


Pressure and density can be determined as follows. Because S = constant, it 
follows from (10.16) that 


pp’ =p- 
From this and from c? = yp/p it follows that 
(eja)? = (p/p1)°- 9 = (p/p)? (10.43) 
Substitution in (10.42) gives 


P sp4 e-a, 2 = (14 -8/0 


Pı 7 
(10.44) 








Suppose the expansion fan separates two constant states (denoted by sub- 
scripts 1 and 2). Then the relation between these states can be expressed 
with po/pi as parameter as follows. Equation (10.43) gives 


co/ey = (po/pr)O-Y!7 , p2/p1 = (p2/pi)/7 . (10.45) 


Equation (10.40) gives 





U2 — Uy = (cy — c2), 


-1 


which by using (10.45) can be rewritten as 





2 
ua — wy = ver{l — (p2/p1)07 D271} (10.46) 


The Riemann problem 


The Riemann problem consists of solving the one-dimensional Euler equations 
for a special class of initial conditions, namely a constant (left) state for 
—oo < r < 0 and a different constant (right) state for 0 < z < oo. The left 
state will be indicated by subscript 1 and the right state will be indicated by 
subscript 4. So the initial conditions are: 


u(0, £) = uy, p(0, xz) = pı, p0, z) = pi, <0; (10 47) 
u(0, £) = u4, p(0, £) = pa , p(0, £) = pa, z>Q0. l 


For the time being we take u4 = 0. If also u; = 0, then the problem is called 
the shock tube problem. In Sect. 18B of Smoller (1983) it is shown that for 
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the Euler equations, the Riemann problem always has a solution (if p1,4 > 0). 


It is easy to see that if U(t, x) is a solution of (10.1), with initial condition 
(10.47), then so is U(at,ax), with a constant. Therefore the solution is of 
similarity form: 


U=U(g), €=2/t. 


We restrict ourselves to the case pı > p4. We will use the following building 
blocks to bridge the gap between states 1 and 4 for t > 0: 


(i) a shock separating states 4 and 3; 
(ii) a contact discontinuity separating states 3 and 2; 
(iii) an expansion fan separating states 2 and 1. 


The situation is sketched in Fig. 10.1. If we succeed in solving the Riemann 
problem with these building blocks, then the structure of the solution as- 
sumed here is correct. 


We start with the shock. Equation (10.37) becomes, with appropriate changes 
of indices, 


(s — u4)? = {1+ G(r}, P = ps/ps- 


We assume p3 > p4, otherwise the entropy condition (10.38) would be vio- 
lated, according to (10.36). As noted before, a fluid particle that traverses a 
shock cannot undergo a drop in pressure, hence s > u4, so that 


-1 7+1 
s= u + eal +IP. (10.48) 





Equation (10.35) gives, with u4 = 0: 
P Pt+a a 
= s(1 — =(P -1 -1 PY, 
u= s(1- ZES) Žo -1+ UP 
For the density we have equation (10.34): 
p3 l+aP 


p a+ P 





+ 


Next, we consider the contact discontinuity separating states 3 and 2. We 
have 


u2 = U3, P2=P3, P2 + P3 - (10.49) 
The contact discontinuity moves with speed uz = u3. 


Finally, we consider the expansion fan separating states 1 and 2. Equation 


(10.46) gives 
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e1{1 — (p2/pr) 7} (10.50) 





2 
uz = u1 + 
y— l 
Equating uz and uz and pz and ps gives a nonlinear equation for P = p3/p4 
in terms of the known pressure ratio: 


sherni- (y — Iea(P - 1) yam (10.51) 


2 er eqy/2y(y-14 (y+ DP) 


With u = 0, this is the basic shock tube equation. After P has been deter- 
mined numerically, the solution can be constructed. It is assumed that (10.51) 
has a real solution for P. For this it is necessary that the term between braces 
is positive. If this is not the case, then the solution has a different structure 
than assumed. This is also the case when p4 > pı. We will not go into this 
further, and assume that states 1 and 4 are specified such that (10.51) has a 
solution. 


When P has been obtained, equations (10.50) and (10.49) give 





2 -1 
uz = u3 = u1 + 72 red — (Ppa/pi) F } . 
Equations (10.45) and (10.34) give 


l+aP 
uy ps = Pa 





p2 = pı(Pp4/p1) 


The speed of sound follows from c? = yp/p. The shock moves with velocity s 
given by (10.48). The contact discontinuity moves with speed uz = u3. The 
left and right sides of the expansion fan move with velocity 


fi = c1, 2 =u2-c. 
The variation of u, p and p in the expansion fan is given by equations (10.42) 


and (10.44). Fig. 10.1 gives a qualitative picture of the flow. 


We have constructed a weak solution that satisfies the entropy condition. It 
can be shown that such solutions are unique; see for example Taylor (1996) 
Chap. 17. 


The case u4 Æ 0 is easily handled by a Galilei transformation. Define new 
velocities: 


Uy = Uy — U4, üa = U4 — Ug — 0. 


The Riemann problem for (ŭ, p, p) can be solved in the way outlined above, 
provided the data are such that a solution of the assumed structure exists. 
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X 


Fig. 10.1. Shock tube problem: z — t diagram of shock (—), contact discontinuity 
(- - -) and expansion fan. 


After the solution has been obtained one puts u = u + u4. 


We will not present the general solution of the Riemann problem. The gen- 
eral solution can be obtained in the same spirit as above, but more possi- 
bilities have to be considered. For instance, there could be two shocks, or 
none; or there could be two expansions fans, or none. See Sect. 93 of Landau 
and Lifshitz (1959) for the various possibilities. Approximate solutions of the 
Riemann problem play a dominant role in an important class of numerical 
methods for the Euler equations, as we will see. 


The shock tube problem is a special kind of Riemann problem. Its solution is 
in the class of solutions that we have constructed above. We have an infinitely 
long tube along the z-axis, with a diaphragm at x = 0, which separates two 
bodies of gas at rest (i.e. uj = u4 = 0). We assume here that gas is the same 
at both sides of the diaphragm, but that p and p are different. The shock 
tube problem consists of determining the flow that develops when at t = 0 
the diagram is suddenly removed. 


A well-known test problem is the shock tube problem proposed by Sod (1978). 
The initial conditions are given by 


uw=0, p =l, =l, u4=0, pa = 0.1, pa = 0.125. 


The solution can be determined in the way described above, and is presented 
in Fig. 10.2. Going from right to left we observe a shock, a contact discontinu- 
ity and an expansion fan. In this and the following figures, the quantity called 
entropy is Inpp~7, which is in fact proportional to the physical entropy (cf. 
(10.16)). There can be a jump of either sign across a contact discontinuity, 
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Fig. 10.2. Solution of Sod’s shock tube problem at t = 0.15 
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Fig. 10.3. Solution of test case of Lax at t = 0.15 
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depending on the choice of the left and right states of the Riemann problem. 
Fluid particles do not cross a contact discontinuity. Figure 10.2 shows that 
the entropy of fluid particles is constant in time, except when they cross a 
shock, in which case the entropy increases. 


Some publications showing numerical results for this test case are Sod (1978), 
Harten, Engquist, Osher, and Chakravarty (1987), Arora and Roe (1997). 
Another frequently used test problem is the Lax test case, see Lax (1954), 
Harten (1983) and the papers just quoted. The initial state is specified by 


ui = 0.698 , pı = 3.528 ; PI= 0.445 , u4 = 0 » paz 0.571 » fat 0.5. 


The solution is shown in Fig. 10.3. The contact discontinuinity and the shock 
are stronger than in Sod’s shock tube problem. 


In these two test cases the flow remains subsonic. Supersonic flow may bring 
additional numerical difficulties. A test case in which supersonic flow occurs 
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Fig. 10.4. Solution of Mach 3 test case at t = 0.09. 


is considered by Arora and Roe (1997), and will be called the Mach 3 test 
case. The data are: 


uy, = 0.92, pı = 10.338 , pi = 3.857 , ug = 3.59, pa=l, pa=l1. 
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The solution is shown in Fig. 10.4. The flow is dominated by a strong expan- 
sion fan. 


10.3 The approximate Riemann solver of Roe 


The usual discretization method for hyperbolic systems of conservation laws 
is the finite volume method. A number of well-known finite volume schemes 
will be presented. Treatment of boundary conditions is postponed till our 
discussion of the three-dimensional case in Chap. 12. 


For more information on finite volume methods for hyperbolic systems of 
conservation laws and the Euler equations in particular, see Godlewski and 
Raviart (1991), Hirsch (1990), Kroner (1997), Laney (1998) and Toro (1997). 


Conservative schemes 


Just as for the scalar case discussed in the preceding chapter, convergence to 
a genuine weak solution can be guaranteed only if the scheme is conservative. 
Therefore we will discuss only conservative schemes, i.e. schemes that can be 
written as (assuming a uniform spatial mesh size h): 


1 
dU; /dt + > (Fj+i/2 — Fj-1j2) = 0, Fj4ay2 = FU j-p, Uj-p41y © Ujta) » 
(10.52) 


where the temporal discretization can be chosen as one likes. Integration of 
the Euler equations (10.1) over a cell results in (10.52) with Fj41/2 given by 


Fy41/2 = F (Uj 41/2) (10.53) 
and 
Tji+1/2 
U; (t) = i U(t, x)dz. 
T5-1/2 


Requiring the spatial discretization to be exact if U = constant, comparison 
of (10.52) and (10.53) results in the following consistency condition: 


The Godunov scheme 


The Godunov scheme has been described in Chap. 9 for a single conservation 
law. Its principle extends to systems of conservation laws; in fact, it was orig- 
inally developed by Godunov (1959) for the Euler equations. The Godunov 
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scheme is described in Sect. 12.15 of Richtmyer and Morton (1967). 


Suppose that at time t = t” we have available the values U? in the cell 


centers. These values are extended to the whole cell, resulting in a piecewise 
constant function U(t”, x). This is used as initial condition to solve 


aU, af) 


ot Ox =0 





for t > t”. Fort- t” < Ž max [A|, with À ranging over the set of eigenvalues 
of the Jacobian of f, the exact solution consists of solutions of local Riemann 
problems at the cell interfaces. For ¢ larger the waves emanating from the 
cell interfaces start to interact, and the solution is no longer composed of 
Riemann solutions. The Riemann problem at the cell interface x = 2541/2 
has left state U? and right state Uj',,. Denote the solution of the Riemann 


problem by U (ane, UF, Ur). For t — tn small enought we have 


“~ 


F(U(t, ej 44/2)) = FU (0; UF, UF ,1)) - 


In the Godunov scheme this is used for the numerical flux function Fj 41/2 in 
the semi-discretized system (10.52): 


Fy41/2 = F(Uj,Uj41) = f(Û (0; UF, UF, ,)) > (10.55) 


Thus, evaluation of F; 41/2 requires the exact solution of a Riemann problem. 
The Riemann problem for the Euler equations has been solved for the shock 
tube problem in the preceding section. Its general solution is somewhat com- 
plicated and will not be discussed. 


Because at every time step the solution is approximated by a piecewise con- 
stant distribution, the Godunov scheme is not exact. Therefore one may as 
well replace the exact solution of the Riemann problem in (10.55) by a simpler 
approximate solution. This has given rise to a class of discretization methods 
called approximate Riemann solvers. These are the subject of the present 
section. As a preliminary, we first discuss the Riemann problem for a linear 
system. 


The Riemann problem for a linear system 


Consider the following Riemann problem: 


ðU ðU mxm 
= t z 78> AER constant , 


U(0,«4)=U,, «<0; U(0,z)=Ur, z>0. 


(10.56) 
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It is assumed that A has m linearly independent eigenvectors Rj,..., Rm with 
corresponding eigenvalues À} < Àz < ... < Am. Define a1,...,am implicitly 
by 


Ur -Ur =) aR, (10.57) 
p=1 


We diagonalize (10.56) in the usual way. Let R be the matrix with columns 
Rj, ..., Rm. Then (10.56) can be rewritten as 


ow ôW i , 
g tip TO W=R U, A = diag(Ay,...,Am) , 


W(0,2)=Wr, 2<0; W(0,2)=Wr, 2>0. 
Writting W = (w1,..., Wm), the solution is, with E = x/t: 
we(§)= wer, E<AR; we(E)=wRe, EDA, k= 1,...,m, 


where Wz and Wp are determined as follows. Let 


UL = > Rp, Ur= Rp- 
p=1 p=1 


From W = RIU it follows that 


Wi = (A1, sey Bm)? ; WR = (v1, nym)? ' 


The solution follows from U(E) = RW (€). We find: U(E) = Uz, € < Ai; 
U (£) = Ur +a1ıRı, A1 < E < Ao; etec., resulting in 


U(§) = UL, E< Ài; 


k 

U(E) = Urt+ do apRp, Ae <E< Akp, k=1,...,.m—15(10.58) 
p=1 

U(E) = Urt+ > apRp =UR, Am <E. 
p=1 


The solution is seen to be piecewise constant. It is left as an exercise to show 
that the jump at x/t = A, satisfies the jump condition (10.23), verifying that 
(10.58) is the correct weak solution. 


The Roe scheme 


The exact Riemann problem is: 
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au, af) 
at Ox 
U(0,2)=U,, x<0; U(0,z)=Ur, «>0. 





= 0 
, (10.59) 


Roe (1981) approximates this by the following linear Riemann problem: 


dU ðU mxm 
EI + A> =0, AER constant, 


U(0O,c)=Ur, «<0; U(0,z)=Ur, z>0. 


(10.60) 


The matrix A depends on Uz and Up: A = A(U1L,UR), and is assumed to 
satisfy the following conditions: 


f(V)-f(W) =A(V,W)\(V - W), (10.61) 
A(V,W) — f (V) a WV, (10.62) 
A(V,W) has only real eigenvalues, (10.63) 
A(V,W) has a complete system of eigenvectors. (10.64) 


Condition (10.61) ensures that the exact solution of the original Riemann 
problem (10.59) is obtained in the particular case when the solution consists 
of a single shock or contact discontinuity. Condition (10.62) ensures con- 
sistency with the original equation. Conditions (10.63) and (10.64) ensure 
solvability of the lmear Riemann problem (10.60). 


Roe’s scheme is given by 


du; 1 
P7 + z (Fi+t/2 — F5-1/2) = 8, 


F541/2 = F(U}, Uj41) = Aj4ija(Uj, Uj41)U (0; Uj, Uj 41) > 


(10.65) 


where U is the exact solution of the linear Riemann problem (10.60) with 
Uz = U; and Ug = U;41 , and A is now named Aj41/2. We call Aj41/2 the 
Roe matrix, and F;41/2 as defined in (10.65) is called the Roe flux. 


A useful expression for the Roe flux can be derived as follows. From the exact 
solution of the linear Rieman problem (10.58) it follows that we have for U 
in (10.65): 


Ù =U, Amn <0; U=U;, >0; 
whereas in the case that for some k we have Ay < 0 < Aga) 
m 


. k 
U = Uj + $ apRp =Uj4i— 2 op Rp 
1 p=k+1 


p= 
k m 
= (U; +U) +31- b japR,- 
p=l p=k+l1 
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Hence 
Fj41/2 = Ajgay2U = Aj41/2(Uj + Ujjs) - 4 2, |Aplap Rp 
p= 


r (10.66) 
= 3{f(U;) + F(Uj41)} - 2 Aplap Rp , 


where we have used (10.61). In this expression we may obviously allow A; = 0; 
this has to be excluded in (10.58) and hence in the derivation leading up to 
(10.66), because it makes no sense to write U(0) if Ù (ê) has a jump at £ = 0, 
but the flux is perfectly well defined (namely zero) in this case. Noting that 
according to (10.58) 


Sap Rp = Uj41 — U; , (10.67) 
p=l 
we see that the Roe flux satisfies 
Fyasy2 = f(Uj), AL >20; Fyaaye = F(Uj41), Am <0, 


so that the Roe scheme is identical to the first order upwind scheme in these 
cases. 


A matrix Aj4,1/2 satisfying Roe’s conditions (10.61)-(10.64) can be con- 
structed as follows. For the Euler equations (10.1) the state and flux vectors 


P pu 
U=]| pu], f= | pur+p |, 
pE puH 


where H = E + p/p, are expressed as functions of the vector 


1 
Z=/p| u 
H 


as follows. Assuming a perfect gas, the equation of state gives 


p= (y—l)pe = (7 ~ I)(PE — sou’), 





so that 
y—1 1,4 1 y-1 4 
p= ——— (pH — -pu*), pE = -pH + pu’. 
y 2 y 2y 
We find 
2124 2142 
U= 2122 , f= 1E z222 + 1 2123 


1 yzl 
> 2123 + gy 7222 Z223 
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Observe that the elements of U and f are homogeneous quadratic functions 
of the elements of Z. We define the difference da = aj, — a; and the average 
a = $(a; + .a;41). Then we have d(ab) = āôb + bôa, ada = 56(aa). Using 
these identities, the following equalities are easily verified: 


i 2z, 0 0 
6U=Bé6Z, B=| ž z1 0 ; 


0 23 Zə 
Hence 
CB-'6U = CôZ = ôf , 
so that 


Aj41/2 = ĒB-! 


satisfies condition (10.61). We find: 


0 1 0 
Aj41/2 = ear o B-72 , v71 
(SB) - BR aoo ye 


Define the following averages: 
vPitj + yPjtitj+i p _ yPiHi + VPj+t A541 (10.68) 
JPi+ Pit VPi + yPi+i 


These are called the Roe averages. It is easily seen that the Roe matrix can 
be rewritten in terms of the Roe averages as 


0 ] 0 
Aj41/2 = i? (3-yju yl]. 


ue aH H-(y-l)u yü 


u = 


Comparison with (10.4) shows that Aj+41/2 equals the Jacobian evaluated at 
the Roe-averaged state (10.68): 


Ajaij2=f (U). (10.69) 


This immediately establishes that A;11/2 satisfies the properties (10.62) 
(10.64). Roe and Pike (1984) show that for the Euler equations the matrix 
Aj41/2 satisfying conditions (10.61)—(10.64) is unique. 
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For the Roe flux we have the expression (10.66). Because of (10.69), Ap 
follows immediately from (10.9): 


Ay=tut-—C€, ASU, à= u+, 
where ¢ is the sound speed associated with u and H: 
- 1 
č = (y-1)(H — 5%) . (10.70) 


It is to be noted that ¢ is not a Roe-average of c; and cj41. The eigenvectors 
Rp follow from (10.21). For simplicity we rescale Ry and R3; Rı is multiplied 
by —2c/p and R3 is multiplied by 2c/p. This gives, using (10.70), 


1 1 1 
R= u — £ , R= u , Ra z= u + 

H — ié tu? + 
The coefficients a, in the Roe flux (10.66) follow from (10.67) as follows. Let 


R be the matrix with columns R1, R2, R3. The rows of R~! (let us call them 
R}, R?, RÌ) constitute an orthogonal system with the columns of R: 


R. R= 6, 


with 6! the Kronecker delta. Taking the inner product of (10.67) with R? 
gives: 


aq = RY (Uj44 — Uj) . (10.71) 


The vectors R!, R?, R? equal the rows of (10.22), scaled appropriately: the 
first row is to be multiplied by —p/2c and the third by p/2c. This gives: 


R! 


(že+0-95), -z0 +0- )Ī), 13). 


—] g? u 
R= (1-125, -03, --04). 
R= (-5e-0-95, =a -0-)), 13) . 


This completes our description of how to determine Àp: @p and Ry for com- 
puting the flux Fj41/2 for the Roe scheme according to (10.66). 


We end our description of the Roe scheme with deriving a useful reformu- 
lation. Let R be the matrix with columns Rp. Then the inverse R~! is the 
matrix with rows RP. It is not difficult to see that we have, with Mp given by 
(10.71), 


do plop Rp = Ro A|R(Uj41 — Uj) , 
p=l 
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where |A| = diag(|A;|,...,|Am|]. We have RAR = A, and write therefore 
LA| = R7"|A|R. This results in the following compact expression for the Roe 
flux: 


1 1 
Fj+1/2 = 5 fi) + f(Uj+1)} — 5 |As+a/2l(Ui+1 —U;). 


Numerical tests of Roe scheme 


We will apply the Roe scheme to the three test cases presented in the pre- 
ceding section. We discretize (10.65) in time by the explicit Euler method, 
and obtain 

1 . 

UPH = UF — AFi a Ea) A=T/h, j=2,...,J—1. (10.72) 
The exact solution is used to prescribe Uf and U7. The time step r and 
mesh size h are constant. A discussion of stability conditions on 7 is deferred 
to the next section. In all cases we take h = 1/48. Results are shown in 
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Fig. 10.5. Sod’s shock tube problem; exact (—) and numerical (ọ) solution with 
Roe scheme; t = 0.1458, à = 0.5. 
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Fig. 10.6. Test case of Lax; t= 0.15, A = 0.2. 
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Fig. 10.7. Mach 3 test case; t = 0.0875, A = 0.2. 
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Figs. 10.5-10.7. As explained in Sect. 10.2, the entropy jump acros the contact 
discontinuity is a function of the initial conditions. Across the shock the 
numerical entropy increases and in the expansion fan it is constant, as it 
should be, in Figs. 10.5 and 10.6. Because we have a first order scheme, the 
smearing of the contact discontinuities in Figs. 10.5 and 10.6 is to be expected. 
Near shocks, the characteristics converge; this produces a steepening effect, 
so that shocks are less smeared than contact discontinuities. In fact, shock 
resolution is very crisp with the Roe scheme. An important observation is 
that spurious wiggles are absent. In the test cases of Figs. 10.5 and 10.6 the 
accuracy is satisfactory for a first order scheme. For the Mach 3 test case 
(Fig. 10.7), however, the numerical solution is completely wrong. This is a 
beautiful example of violation of the entropy condition. The expansion fan 
is replaced by an expansion shock in the sonic point. According to (10.41), 
the sonic point in an expansion fan does not move. Hence fluid particles 
cross the sonic point from left to right. The figure shows they undergo a 
decrease of entropy, in violation of the entropy condition. This violation of 
the entropy condition in sonic points of expansion fans by the Roe scheme 
will be explained later. Note that the expansion fans occuring in Figs. 10.5 
and 10.6 do not contain a sonic point, and are approximated satisfactorily. 


When the expansion fan is less strong than in the Mach 3 test case, the Roe 
scheme does not necessarily violate the entropy condition. This is illustrated 
by the following shock tube problem, which contains a supersonic zone, and 
which we will refer to as the supersonic shock tube problem: 


pr=pL=8, pr=pr=0.2, up=ur=0. (10.73) 


The exact and numerical solutions are shown in Fig. 10.8. A small jump 
is visible near the sonic point in the expansion fan. This is called a sonic 
glitch. It looks like a small expansion shock, violating the entropy condition. 
However, the entropy is seen not to decrease as a fluid crosses the stationary 
sonic point, but to increase a little. Hence there is no violation of the entropy 
condition. We see that a sonic glitch may appear even if the entropy condition 
is satisfied. The lower part of Fig. 10.8 shows that the sonic glitch becomes 
smaller when the mesh size is decreased. But it is found that for the Mach 3 
test case mesh refinement does not help. We will now consider a remedy for 
this shortcoming of the Roe scheme. 


Sonic entropy fix for the Roe scheme 


As seen in Sect. 10.2, expansion fans are associated with A, or Àa. In order 
to analyze the way in which an expansion fan is approximated by the Roe 
scheme, let us consider the special case in which U;41; — U; is such that 
az = a3 = 0 in (10.57). If we also assume that 
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Fig. 10.8. Supersonic shock tube problem; t = 0.1562, 4 = 0.3, k = 1/48 (above), 
h = 1/96 (below). 
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hy =a-=0, (10.74) 


then equation (10.66) gives for the Roe flux 


Fj41/2 = LIAU) + f(U;+1)} - 


This means that central discretization is used, without artificial viscosity, so 
there is a danger that entropy decreases, and that discontinuities are insuf- 
ficiently smeared out. This is what we see in Fig. 10.7. But if (10.74) does 
not hold, i.e. the flow is not sonic, then we do not have central discretiza- 
tion, and there may be enough dissipation to prevent violation of the entropy 
condition. This is confirmed by the correct approximation of the subsonic 
expansion fans in Fig. 10.5 and 10.6. 


An often used artifice to add dissipation to the Roe scheme near sonic con- 
ditions has been proposed by Harten (1984). The eigenvalues À; and A3 are 
slightly increased in the vicinity of zero, replacing them in (10.66) by 


oI 


[Ap] if [à| >E, p=1,3; 
lp! 


(10.75) 
I(E + ¢) if |A| <€, p=1,3. 


for some small value of £. Since (10.75) comes only into play when |u| % c, 
this is called a sonic entropy fix. Fig. 10.9 presents numerical results for the 
supersonic shock tube test case, with € = 0.5. Compared with Fig. 10.8, 
the sonic glitch has almost disappeared. But the strong expansion shock of 
Fig. 10.7 is not completely replaced by an expansion fan by this sonic entropy 
fix, as shown in Fig 10.10, at least for € = 0.5; we think this is rather large 
value of c. Nevertheless, the entropy condition is now satisfied, as shown by 
the entropy plot. 


Summarizing, the Roe scheme performs well for flows without strong expan- 
sion fans. The sonic entropy fix of Harten cures the violation of the entropy 
condition, but significant sonic glitches may remain. 


Exercise 10.3.1. Show that the solution (10.58) satisfies the jump condition 
(10.23). 


10.4 The Osher scheme 


The Osher scheme (Osher (1981), Osher and Solomon (1982), Osher and 
Chakravarthy (1983)) can be regarded as a generalization of the Engquist- 
Osher scheme (9.82) from the scalar case to the systems case. In Sect. 9.4 we 
saw that the Engquist-Osher scheme is an upwind scheme that satisfies the 
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Fig. 10.9. Supersonic shock tube problem with sonic entropy fix; € = 0.5. 
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Fig. 10.10. Mach 3 test case with sonic entropy fix; € = 0.5. 
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entropy condition. This was seen to be not the case for the Courant-Isaacson- 
Rees upwind scheme. It turns out that the Osher scheme also satisfies the 
entropy condition. 


We first rewrite the Engquist-Osher flux as follows. Let p(y) be the Heaviside 
function: 


u) =1, p>0; p(y) =0, ¥ <0. 


Then the Engquist-Osher flux can be written as 


Baap = $0) + f as des | -S wear 
0 0 
= flet f Q- MDF Ode = Fes) + [ mint (0), 0dv 
We can also write 
Fy41j2 = f(pi+1) — J uF (bdh = Flp) - J max{f (4), 0}dy . 
Pa Pi 


Hence, we can write for the Engquist-Osher flux: 


Pipl 


Fj = sifle) + HeD f lf wiley. 


Pi 


This is generalized to systems by interpreting the absolute value of the Jaco- 
bian as any other matrix function, namely as 


If |= RIAIR? , 


where |A| is the diagonal matrix with elements the absolute values of the 
eigenvalues of f (given by (10.9)) and R the matrix with the eigenvectors 
(given by (10.21)) as columns. This gives the following generalization of the 
Engquist-Osher flux for systems: 


Uy 
Fip =5(fUo) +f} 5 f Wey, Q076) 
Uo 


where for brevity we have put j = 0; Uo is the state to the left and U: 
is the state to the right of the cell interface. The path is parametrized by 
U=U(c), 0<o0 <1, U(0) = Uo, U(1) = U1, so that (10.76) becomes 
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Fin = 5(fUa) + 10} f Oa qo) 


The integral depends on the path. In Osher’s scheme the path is chosen 
such that the integral is easy to evaluate and the computing cost is small. 
The path is divided in three parts 7; (0 < o < 1/3), Iy(1/3 < o < 2/3) 
and I; (2/3 < ø < 1). The integration path is chosen such that 


dU dU dU 
To 7 s on I) , To m on I, To 7 tt on I}. (10.78) 


This choice has the following consequences. Let R!, R?, R? be the rows of R=} 
(given by (10.22)), so that RP R, = 6° (Kronecker delta). Then it follows from 
(10.78) that we have on each subpath 


gd _ 


we 10. 
RI = 0 (10.79) 


for two values of q. We show that this implies that two Riemann invariants 


are constant on Ip. To this end we transform to the nonconservative variables 
W defined by (10.5), so that (10.79) becomes 


dw 
1Q— = 

R'Q do 0, 

where Q = 0U/AW is given by (10.6). We obtain 

R'QdW = du— Lap, 
pe 

2 1 p 

R’QdW = dp— -zdp = dS , 


R°QdW = du + | op . 
pe 


where we have used (10.17). On Tı equation (10.78) holds for q = 1,2, so 
that 


S = constant, du — zdp =. (10.80) 


This can be rewritten as follows. Since the entropy is constant, equation 
(10.17) gives with ce? = kyp?~! 





i 
—dp= 
pc 


and (10.80) becomes, using (10.16): 
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2c 
pp’ = constant, u- JTI = constant. 


Similarly, on T3: 
— 2c 
pp *=constant, u+ nr ie constant . 
y —_— 


Finally, on T% we have du + dp/pc = 0, so that on T> 
u = constant , p = constant . 


By means of these relations we can determine the end points of Ti, 3 and 
T in the following way. Defining 


z= In(pp”) , 


we can write the perfect gas law as follows, remembering that c? = yRT: 


e7 1 


p= plea) = V). (10.81) 


The conditions along the subpaths give the following relations between the 
states at the end points: 














= — = _ =W 
41/3 = 70, Ui1/3 y— jo1/3 uo — 1° 0; 
U1/3 = U2/3,, P2/3 = P1/3; 

= = =W. 
22/3 = Zi, Ua3at ya [o2/3 = “1 + 7 1% 1 


From p2/3 = p1/3 it follows that, using (10.81), 
€23/ erg = e72/2~ 71/90/29 = el1—20)/27 = ey Jeg =a. 


We obtain the following linear system: 








2 
u13 — y- 761/3 = Yo, u3 + 7 701/3 = P. 
The solution is 
— 1 alo +W 
ĉ1/3 = 00 Da (¥i- Yo), u3 = — i * (10.82) 


Since ¢;/3 connot be negative, a meaningful solution does not exist if Y, < Wo. 
This corresponds to u; < ug. We will not worry here about this uncommon 
case. 


Having determined the integration path, we now have to evaluate the integral 
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in (10.77). It turns out that for this we need to determine points on Ij 
where A; changes sign and on T3 where 43 changes sign (sonic points). Since 
A2 = u = constant on I2, Àz does not change sign on T>. It is a fact — not 
shown here — that for the Euler equations there cannot be more than one 
sonic point on I; the same holds for I3. Hence, a sonic point occurs on Ti 
if and only if 


(uo + co)(u1/3 +€1/3) <0. 





Let in the sonic point on T; (if any) u = uj, c = cj = —uj. Then we have 
y-1 
= Zo, uj = Y, ci = uy. 
0 0 1 y+1 O, "4 1 


Similarly, on T3 a sonic point occurs if and only if 
(ui — €1)(u2/3 — €2/3) <0, 


and satisfies 





We are now ready to evaluate the integral in (10.77). We denote the value of 
g on I, or Ts where a sonic point occurs (if any) by d1 or o3, respectively, 
and define the sign function by 


sign(A)=1, A>O; sign(A)=0, A=O; sign(A)=—-1, A<O. 
We have 


[ify = f If WIRedo = f [raiReae 
= sign(A3(0)) | AsRsdo = sign(As(0)) | f (U)dU 
| | 


= sign(A3(0))( fo, = fo) ’ 


where we abbreviate 


f(U(o1)) = for, F(U(0)) = fo. 


The remaining parts of the integral are evaluated similarly. Equation (10.77) 
results in the following flux for the Osher scheme: 


Fija = 5{1 + sign(As(0))} fo + 5 {sigm(As(5)) = sign (às (0))} fo, 


l.. . 1 
+5 (sign(u1/s) — sign(A3(— 


5 )) haya + 5{sign(Ax(3)) — sign(urys) boys 
2 
(3 


+5 {sign (1)) ~sign(a(S)) Hes + 5 {1 — signa (1))) A 
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In order to avoid unnecessary evaluations of f, which are expensive, this can 
be programmed as follows: 


sı = 1+sign(A3(0)) ; s2 = sign(As($)) — sign(As(0)) ; 
s3 = sign(ui/3) — sign(à3(4)) > S4= sign(A1(2)) — sign{u1/3) ; 
85 = sign(A;(1)) — sign(\1(2)) > s&s = 1 — sign{A(1)) ; 
Fiz =0; 
if sı # 0 then 
Fij = Fija + Łs1 fo; 
if s2 # 0 then 
Fija = Fija + $82 fon; 
if sz Æ 0 then 
Fija = Fij2 + $83 f1/3; 
if s4 Æ 0 then 
Fij2 = Fyj2 + 584 fz23; 
if s; # 0 then 
Fiyo = Fija + $55 fos; 
if sẹ # 0 then 
Fyj2 = Fiyo + $56f1; 


This may still result in some superfluous evaluations of f, since it can happen 
that some contributions cancel. To avoid this the various possibilities can be 
put in a logical table; see for instance Sect. 12.3.3 of Toro (1997). 


Hemker and Spekreijse (1986) have proposed a version of the Osher scheme 
in which the order of the subpaths is reversed. That is, Ip is parallel to 
Rp, p = 1,2,3. It turns out that this usually results in significantly fewer 
evaluations of the flux f(U), so that this version requires less computing 
time. 


It has been proven (Osher and Solomon (1982)) that the Osher scheme in the 
semi-discrete form (10.52) converges as h | 0 to a solution satisfying the en- 
tropy condition, under the assumption that the numerical solution converges. 
This proof does not hold for the version of Hemker and Spekreijse. 


Another nice property of the Osher scheme (in both versions) is that the 
flux F 1/2 is a differentiable function of Uo and U1, as shown by Hemker and 
Spekreijse (1986), who also give expressions for the derivatives concerned. 
This is beneficial for the convergence behavior of iterative solution meth- 
ods for implicit versions of the scheme. The Roe flux is not a differentiable 
function of the two neighbouring states. 
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Numerical experiments 


We apply the Osher scheme to the test cases of Sect. 10.3. The temporal 
discretization is given by (10.72). We start with the test case that is most de- 
manding with respect to the entropy condition, namely the Mach 3 test case. 
The original Osher scheme is called the O-variant; the version of Hemker and 
Spekreijse is called the H-variant. Fig. 10.11 gives results for the O-variant; 
results for the H-variant are found to be identical in this case. Although near 
the sonic point there is a slight discontinuity that looks like an expansion 
shock, the results are not in disagreement with entropy condition, because 
the entropy plot shows that the entropy of a fluid particle does not decrease. 
As shown by the lower half of the figure, the sonic glitch becomes smaller 
with mesh refinement. Fig. 10.12 gives results for the supersonic shock tube 
problem. Unless stated otherwise, h = 1/48. With the O-variant, there is 
almost no sonic glitch in the expansion zone. The H-variant shows a larger 
sonic glitch, but the entropy plot shows there is no violation of the entropy 
condition. 


Results for Sod’s shock tube problem are shown in Fig. 10.13. The O- and 
H-variants give good and almost indistinguishable results. For the test case 
of Lax (Fig. 10.14) results for the O- and H-variants are found to be indis- 
tinguishable. 


Summarizing, these tests show that the O- and H-variants of the Osher 
scheme give almost the same results, except near sonic points; it may hap- 
pen that the H-variant gives a larger sonic glitch than the O-variant. Both 
variants seem to satisfy the entropy condition, a fact that has been proven 
for the O-variant. 


In Table 10.1 the number of flux evaluations required to compute these test 
cases is listed. The H-variant is seen to bring significant savings. 


Mach 3 


Supersonic s.t. 


Sod 


Lax 





Table 10.1. Number of flux evaluations required for test case computations. 


Comparing with the results obtained with the Roe scheme in Sect. 10.3, we 
see that for the test cases of Sod and Lax (both without sonic points) the 
Osher scheme and the Roe scheme give almost the same results. When a 
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Fig. 10.11. Mach 3 test case, O-variant; t = 0.0875, À = 0.2, h = 1/48 (above), 
h = 1/96 (below); —: exact solution, o: numerical solution. 
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Fig. 10.12. Supersonic shock tube problem with O-variant (above) and 
H-variant (below); t = 0.15, A = 0.3. 
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Fig. 10.13. Sod’s shock tube problem with O-variant; t = 0.1667, A = 0.4. 
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Fig. 10.14. Test case of Lax with O-variant; t = 0.15, 4 = 0.2. 
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strong expansion fan is present, as in the Mach 3 test case, the Roe scheme 
suffers from violation of the entropy condition. 


10.5 Flux splitting schemes 


Flux splitting schemes can be regarded as a generalization of the Courant- 
Isaacson-Rees scheme (9.81) from the scalar to the systems case. We recall 
that in the scalar case the numerical flux is given by 


Fy41/2 = Fle) if F (pja) > 0, 
Fyaij2 = f(pj+1) if F (2341/2) <0, 


where p;41/2 = (p; + pj+1)/2. Steger and Warming (1981) have generalized 
this to systems in the following way. The Euler flux is split as follows: 


fU) = f+ (U) + fF (U) (10.83) 
such that 
A(ðft/3U)>0, A(Af~ /AV) <0, (10.84) 


i.e. the Jacobian of ft has only nonnegative eigenvalues and the Jacobian 
of f~ has only negative eigenvalues. Then we can write down the following 
generalization of the Courant-Isaacson-Rees numerical flux: 


Fyaaya = f+ (Us) + f~ (Uz4) - (10.85) 


The resulting scheme is obviously conservative and consistent (because 
(10.54) is satisfied), 


The van Leer scheme 


Various splittings of type (10.83), (10.84) are possible, and the performance 
of the scheme depends on the choice that is made. In order to obtain a good 
scheme, van Leer (1982) puts the following requirements on the splitting, in 
addition to (10.84): 


(i) f*(U) must depend continuously on U; 
(ii) ft (U) = f(U) for M > 1 and f- (U) = f(U) for M < -1; 
(iii) ft + fT must have the same symmetry properties with respect to M 
(keeping all other state variables constant) as f; 
(iv) The Jacobians ð f*/ðU must depend continuously on U; 
(v) Of* /AU must have one zero eigenvalue for |M] < 1; 
(vi) Like f, f* must be a polynomial in M, and of the lowest possible degree. 
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We will now clarify these requirements. We note that we can write 


pcoM 
F(U) = | pe? (M? + 1/7) , M=u/c. 
pce?M(5M* + 1/(7 - 1) 


Hence, keeping p and c constant, 
fija(M) = ~fi,3(—M), fe(M) = fe(—M). 


Requirement (iii) therefore becomes: 


fi3(M) + fi3(M) = —fi3(—M) + fi3(M) , 
ff (M) + fy (M) = f} (—M) + fz (—M). 


Requirements (i) and (iv) make the numerical solution smooth , especially 
near sonic points (M = 1) and stagnation points (u = 0), where eigenvalues 
change sign. Furthermore, these requirements enhance the convergence be- 
havior of iterative methods for the solution of implicit versions of the scheme. 
Requirement (ii) ensures that the numerical scheme has the same domain of 
dependence as the differential equation. Requirement (iii) makes the numer- 
ical flux share an important property with the exact flux. It turns out that 
requirement (v) enables the scheme to capture stationary shocks in two cells. 
Finally, requirement (vi) makes the splitting unique. 


From these requirements van Leer derives the following splitting: 


łpe(1 + M}? 
ft = | C+- 1)M) 
apa l f 
(10.86) 
—}pe(1 — M)? 
p= | ibo- m- 2) 
wea irr 


for |M| < 1; for |M| > 1 the splitting is given by requirement (ii). The only 
requirement that is not trivially satisfied is (10.84). It is left as an exercise 
to show that (10.84) is satisfied. 


The modification of Hanel, Schwane and Seider 


According to Bernoulli’s law, for stationary inviscid flow the total enthalpy 
satisfies H = h+ tu? = constant along streamlines. If the streamlines emanate 
from a region of constant H, then H = constant everywhere. We can also see 
immediately from the stationary mass and energy equations 
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divpu=0, divpuH =0 


that H = constant is a solution. In general, for numerical schemes H = 
constant is not an exact solution, but gives a residual of discretization error 
size. Hanel, Schwane, and Seider (1987) report that this may cause significant 
errors in regions of strong Mach number changes. They propose to modify 
van Leer’s flux splitting for the energy equation by: 


puH = fi H+ fra, (10.87) 


with fı = fi + fy the van Leer flux splitting for the mass conservation 
equation. That the flux splitting (10.87) has the desired effect can be seen as 
follows. In the stationary case the discretized energy equation becomes with 
the splitting (10.87): 


ft (Uj-1)Hj-1 + {f (U;) — JF (U;)}H; ~ fr Ui41)Hj41 = 0. (10.88) 
The stationary discrete mass conservation equation gives 
ft (Uj-1) + fr (Uj) — FRU) - fF Uja) = 0, 
so that H; = constant is indeed a solution of (10.88). 


It turns out that of the six requirements that determine van Leer’s flux split- 
ting, the modification of Hanel et al. violates only requirement (v), the pur- 
pose of which is {as stated before) to ensure capturing of stationary shocks 
in two cells. Fig. 10.15 shows an example with a slowly moving weak shock. 
We see that the modified scheme indeed needs a few more cells to capture 
the shock. But it turns out that in most cases the shock resolution of the 
modified scheme is about as crisp as that of the original scheme. Results ob- 
tained with the modified scheme for the four test cases discussed before will 
not be shown, because they resemble closely the results obtained with the 
original scheme, except that the modified scheme gives a somewhat smaller 
sonic glitch for the supersonic shock tube problem. 


The Osher scheme allows H; = constant in the stationary case. As for the 
scheme of Hänel et al., this follows from the fact that the energy flux equals 
H times the mass flux. In the stationary case the scheme gives 


Fy41/2 — Fj—1/2 =0. (10.89) 


For the mass conservation equation this gives with the Osher scheme an 
expression of the type 


X am, =0, m=pu, (10.90) 


where s indicates the various states occurring along the integration paths 
used to determine Fj41/2, and a, are coefficients that need not be specified. 
The Osher scheme gives for the stationary energy equation 
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Fig. 10.15. Van Leer scheme (above) and of Hänel c.s. (below); pz = 1, 
pr=0.9, pL = 1, pr = 0.9275, uz = —1, ur = —1.0781, t = 0.175, A= 0.4. 


440 10. The Euler equations in one space dimension 


Samm, H, =0. (10.91) 


Comparison of (10.90) and (10.91) shows that H, = constant is a solution. 


The Roe scheme, however, does not in general allow H; = constant in the 
stationary case. This can be shown as follows. It suffices to consider the 
special case where 


Uj4i ~ U; = (a R1)j41/2 - 


with aj41/2 an arbitrary coefficient. Then (10.66) and (10.89) give for the 
mass and energy conservation equations 


Mj41 — Mj_1 — (al — S|) 5412 + (alu — El)j-12 = 0, 
(mH)j41 — (MH)j-1 ~ (alt — ë|(H — üë))j+1/2 + 
(ajū — e|(H — tic));-1;2 =0, 


and it is clear that H; = H; 1/2 = constant is not a solution. 


Resolution of stationary contact discontinuities 


A significant defect of the van Leer scheme is the fact that stationary contact 
discontinuities cannot be resolved, as noted in van Leer (1982). This we now 
show. Suppose the flow is stationary, and that there is a stationary contact 
discontinuity at £;41/2. Hence 


uk =0, Pk = Pky, Vk; 
, , 10.92 
Prk = pp, kSj; Pk = Pii, k>j+l. l ) 


The van Leer scheme gives, using the fact that (pc?);41 = (pc?°);, 


Ww, 1 q1(pC)j+1 — 2(pc); + (pe) ;-1} 
=i => a 0 #0, (10.93) 
BETA (Cj41 — Bey + cj-1) 

so that the numerical solution will not be stationary, but the contact discon- 
tinutity will be approximated by a smooth profile that widens diffusively as 
time progresses. This is illustrated in Fig. 10.16. The Osher scheme does not 
have this defect, as seen in Fig. 10.16. It is left as an exercise to show that 
for a stationary contact discontinuity the Osher flux is zero, like the exact 
Euler flux. Hence the scheme gives dU;/dt = 0. The Roe flux is also zero; 
this is also left as an exercise. We will show in Chap. 12 that this smearing of 
stationary contact discontinuities makes the van Leer scheme unsuitable for 
the discretization of the inviscid terms in viscous flow computations. Further 
numerical results will be presented only for the improved version below. 
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Qa 1 





Fig. 10.16. Stationary contact discontinuity at t = 0.1 and ¢ = 1. Left: van Leer 
scheme; right: Osher scheme. A = 0.4, pr = pr = 0.5, uL = ur = 0, 
PL = 1, PR = 0.6. 


The AUSM scheme 


In Liou and Steffen (1993) a flux splitting scheme is proposed that is similar 
to the van Leer scheme, but that has a satisfactory resolution of stationary 
contact discontinuities. This scheme is often called the AUSM (advection 
upstream splitting method} scheme, and represents the state of the art of 
flux splitting schemes. It rivals the accuracy of the Osher and Roe schemes, 
but requires less computing. 


The AUSM scheme is defined as follows: 
1 ~ 1 ~ ~ ~ 
Fyaaya = 5(M + |M])j+1/2U; + 5M ~ IMI) j41/2U544 + Pt + Pry, 
M;j+1/2 = M} + Mj} » 


1 
M* = £7 (M+1)?, M| <1; M*=5(M-+[M)), |M|>1, 
; 1\ | 0 
U=pe| u |, Pe =([ pt], 
H 0 


p* = 5(14M), M| <1; p* = Ê(1 + |M|/M) , M| >1. 


Note that this scheme is not quite of the form (10.85). 


Suppose we have a stationary contact discontinuity at £;+1/2, with Up given 
by (10.92). It is left to the reader to show that in this case 


dU;, 
a 0, Vk, 

so that a stationary contact discontinuity is resolved exactly by the AUSM 
scheme. 
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It is left to the reader to show that the AUSM scheme allows H; = constant 
in the stationary case. 


Numerical experiments 


We discretize in time according to (10.72). Figs. 10.17-10.20 show results 
obtained with the AUSM scheme for the four test cases considered before. 
The results are at least as accurate as those obtained with the Osher and 
Roe schemes, and in fact better where the sonic glitch is concerned. 


Exercise 10.5.1. Show that van Leer’s flux splitting (10.86) satisfies condi- 
tion (10.84). 
Hint: one eigenvalue is zero. Show that the other two satisfiy for |M| < 1: 


X aq + M)[1 + P - M)’ — 24(1 — M) ~ 2(y + 3)}] 
e? 1—M 


+ (1+ M)*[l - {47(7 - 1)(1-M) + (y+ D(y—-3)}] = 0 


8y(7 4+ 1) 
the roots of which are positive for 1 < y < 3. 
Exercise 10.5.2. Derive equation (10.93). 


Exercise 10.5.3. Show that the AUSM scheme allows H; = constant in the 
stationary case. 


Exercise 10.5.4. Show that for the AUSM scheme we have dU;/dt = 0 in 
the case of a stationary contact discontinuity. 
Hint: show that Mj41/2 = 0. 


Exercise 10.5.5. Show that for the Osher scheme we have dU, /dt = 0 in 
the case of a stationary contact discontinuity. 

Hint: show that (10.82) gives u1;3 = 0. Since up = u, = 0, all contributions 
to the Osher flux are zero. 


Exercise 10.5.6. Show that for the Roe scheme we have dU; /dt = 0 in the 
case of a stationary contact discontinuity. 
Hint: show that a; = a3 = A = Q. 


10.6 Numerical stability 


Rigorous stability analysis of numerical schemes for nonlinear hyperbolic sys- 
tems is difficult, and has rarely been attempted. With von Neumann stability 
analysis, we linearize, freeze the coefficients and study the growth or decay 
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DENSITY VELOCITY PRESSURE 





MACHNUMBER ENTROPY Liou-Steffen scheme 





0 0.5 4 


Fig. 10.17. Sod’s shock tube problem; -~: exact solution; o: numerical 
solution; t = 0.168, A = 0.35. 
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MACHNUMBER ENTROPY Liou-Steffen scheme 





0 0.5 1 
Fig. 10.18. Test case of Lax: t = 0.146, à = 0.25. 
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DENSITY VELOCITY PRESSURE 





MACHNUMBER ENTROPY Liou-Stetfen scheme 





0 
0 0.5 1 
Fig. 10.19. Mach 3 test case; t = 0.0875, à = 0.2. 


DENSITY VELOCITY PRESSURE 





Liou-Steffen scheme 





D a 05 = 0 05 1 
Fig. 10.20. Supersonic shock tube problem; t = 0.14, À = 0.11. 
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of the solution by means of Fourier analysis. As an example, consider the 
explicit Euler method for a flux splitting scheme. The numerical flux is given 
by (10.85), and we get 


UPH — UF + M Ft (UP) — fF (URL) +f OR) -— fo (UP) = 0. 
Linearization and freezing of the coefficients gives 


UP! — UP + AFT (UP — UF) + AFT (UR — UP) = 0, 


i 


where F* are the Jacobians of f +, assumed constant. Assuming an harmonic 
wave U? = Ure? gives U"+1 = GU", with the amplification matrix G given 
by 


G = I= AFt(1-— e7) — AFT (e — 1) 
= I~ A(F+t — F~)(1—cos6) — iàf sinð , 


where we have used Ft + F7 = f, with f the Jacobian of f (cf. (10.83)). 
We have stability if the eigenvalues p of G satisfy 


le(G)| <1, Ve. 


However, because the eigenvectors of Ft — F~ are in general different from 
those of f , G is not easily diagonalizable, if at all, and u(G) is hard to deter- 
mine. Furthermore, freezing the coefficients is unrealistic where the solution 
is not smooth, for instance near shocks. 


For simplicity we proceed in an even less rigorous manner, and linearize the 
system of differential equations instead of the numerical scheme, and freeze 
the coefficients. This leads us to the following system: 


aU | OU _ 


a tf Oz 0, 


with the Jacobian f assumed constant. Let R be the matrix with columns 
Ry, Re, R3, given by (10.21). Diagonalization gives 

Ov OV 

—+A—=0, V=R UU, 

ot t ðr U 
with A a diagonal matrix containing the eigenvalues u = u, utc of f . This 
is an uncoupled system of scalar equations of the type 


ðv 4 Ov 
at ar 
We now require that the scheme used for the Euler equations is stable when 
applied to (10.94). The schemes discussed before are generalizations of the 


=(. (10.94) 
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first order upwind scheme to the systems case. Application of the first order 
upwind scheme to (10.94) gives, assuming 4 > 0, 


dv; H 
at Ae 


i vj—1) = 0 . (10.95) 


Von Neumann stability analysis for temporal discretizations of (10.95) is 
easily carried out following the principles explained in Sect. 5.6. The symbol 
of the scheme is given by 


A 


La (0) = =(1 —e), oa=yur/h. 


We have stability if —7Z,, is in the stability domain S of the time stepping 
method to be used. Stability is required for all values p that occur in the flow 
domain. This means that we have to take 


p= p = max{|ul+c}. 
The corresponding value of c, i.e. 
o=pr/h 
is usually called the CFL (Courant-Friedrichs-Lewy) number. 
For the forward Euler method we have stability if 
a<l. (10.96) 


From the exact solutions, we obtain for the four test cases considered in the 
preceding section the approximate estimates for j listed in Table 10.2, which 
also gives the stability bounds À, for \ = r/h that follow from (10.96). The 


Lax 


Mach 3 
Supers. shock tube 





Table 10.2. Estimates for js and stability bounds for A. 


numerical results in the preceding sections were obtained with values of À 
close to A4. 


Exercise 10.6.1. Derive equation (10.96). 
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10.7 The Jameson-Schmidt-Turkel scheme 


Second order central discretization gives the following numerical flux: 


Fyasya = F(U) + F(U;)} 


This scheme is unusable because it gives serious oscillations near disconti- 
nuities, and may also generate oscillations elsewhere. The schemes of Roe, 
Osher and van Leer have a numerical flux that can be written as 


1 
Fj41/2 = 5 (f(Uj) + f(Uj41)} + extra term . 


The extra term is given explicitly for the Roe scheme in (10.66) and for the 
Osher scheme in (10.76). The extra term provides just enough dissipation 
to obliterate numerical wiggles, while still maintaining crisp resolution of 
discontinuities; moving contact discontinuities are smeared, however. In the 
schemes of Roe, Osher and in flux splitting schemes the extra term is an 
implicit consequnce of the way in which the Riemann problem at the cell face 
is approximated. Instead, one may try to design the extra term explicitly. This 
is the approach of the Jameson-Schmidt-Turkel or JST scheme, developed by 
Jameson, Schmidt, and Turkel (1981), Jameson (1985b), Jameson (1985a), 
Jameson (1988). Of course, the extra term has to be designed judiciously. The 
aim is to obtain a scheme with greater simplicity and economy of computation 
than the schemes described before, allowing if necessary a slight increase of 
numerical shock thickness. 


Artificial viscosity 


For the scalar case, the JST scheme has been presented in Sect. 9.4. In Jame- 
son (1985a) the following numerical flux is proposed for the Euler equations: 


1 
Fj41/2 = 51 F(U5) + f(Uj41)} — dj41/2, 


where dj 41/2 is an artificial viscosity term. Writing d = (d’, d?, d°)7, for the 
mass conservation equation d! is chosen as follows: 


2 
dj 44/2 = ria/2{Ejpr (Pitt — pj) - EO jalp? ~ 3pj341 + 3p; — pj-1)} - 
(10.97) 


Here 7;41/2 is a coefficient chosen to give the artificial viscosity term the 
proper scale. On dimensional grounds, it must have the dimension of velocity, 
and is taken to be an estimate of the special radius of the Jacobian 0f/0U 
at the cell face: 
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1 
Tj+1/2 = 5 (lua +e; + |ui] + cj+1)- 


The term with e(?) represents artificial diffusion proportional to the second 
derivative of p. Its purpose is to damp oscillations near discontinuities. This 
term is turned off adaptively in regions where the solution is smooth; there the 
term with (4) takes over. This term is proportional to the fourth derivative of 
p. A sensor that signals the presence of a shock is needed. This shock sensor 
is chosen as follows: 

y, = (Piti — °P; + Pi-1 

7 [P+ + 2P; + pj- 

We set 


Dy 41j2 = MaX{Vj +2, Vj41, Vj, Vj-1} 
Then we take 


2 . Ll _ 
ae = min{ 5,5; 41/2} 


e(4) = max{0, kí) — obj 41;9} ; 


E 


where k), k(4) and a are constants, typically chosen as follows: 
kO =1, k® =1/32, a=2. (10.98) 


We see that near shocks, where ¥;41/2 becomes of order one, el, /2 is turned 
off; this is necessary because the fourth order dissipation term is found to 
produce overshoots near shocks. This completes the spatial discretization of 
the mass conservation equation. In regions where the solution is smooth we 
have 1341/2 < 1, so that eu is switched of. Hence Tia 1ja—4j_ay2 = O(h4), 
giving the scheme the order of accuracy of a second order central scheme, 
namely O(h*). Near shocks the solution is not differentiable, and it makes 
little sense to estimate the local truncation error with Taylor expansion. 


The spatial discretization of the momentum equation is obtained by replacing 
p by m = pu in (10.97): 
2 
dF aye = riade iMi —m;) 
4 
— el") j2(mja2 — 3mj41 + 3m; — mj-1)} , 

where the coefficients are defined as before. The spatial discretization of the 
energy equation is obtained by replacing p not by pE, as one might expect, 
but by pH. This is done to make H = constant a solution of the stationary 
energy equation, making the scheme conform to Bernoulli’s law. So we obtain 

2 

rs aye =r; (Ef) PH) = (pH);) = 
€541/2((PH)j42 — 3(PH)j+1 + 3(pH); — (pH);-1)} - 
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Temporal discretization 


Since in regions where the solution is smooth the JST scheme is virtually 
identical to the second order central scheme, the explicit Euler scheme cannot 
be used for time stepping. In the linearized scalar case, the Fourier symbol of 
the spatial discretization is (almost) purely imaginary. Therefore a temporal 
discretization method is needed with a stability domain that includes part of 
the imaginary axis. Certain Runge-Kutta methods are suitable. The system 
of ordinary differential equations that results after spatial discretization can 
be written as 


where the algebraic vector V(t) contains all unknowns. Consider the following 
explicit four-stage Runge-Kutta method: 


vO = yr, 
VY) = yO _ airR(V 


(vy) 

Vi) = YO _ a rR(ve 
(ve 
( 


) | (10.99) 
), 


Il 


) 

) 

VE) = VO _ asrR(v 
yrtt = yO _ RV), 
The coefficients are chosed to maximize the stability interval along the imag- 
inary axis. Temporal accuracy is given low priority, because the JST scheme 
is designed primarily for stationary flows. The general problem of finding the 
coefficients that define Runge-Kutta methods such that for a given tempo- 
ral accuracy the stability interval along the imaginary axis is maximized has 
been considered by van der Houwen (1977). For temporal accuracy of order 
one, Sonneveld and van Leer (1985) give a solution, namely 


ay=1/3, a, = 4/15, a3 = 5/9. (10.100) 


The classical four-stage Runge-Kutta method (maximizing order of accuracy) 
has 


a,=1/4, a = 1/3, œz = 1/2. (10.101) 


The stability domains of these two methods are presented in Fig. 10.22. We 
see that if the symbol of the (linearized) spatial discretization is purely imag- 
inary, then the version of Sonneveld and van Leer allows a time step that is 
5% larger than allowed by the classical version. Hence, this sacrifice of accu- 
racy for stability hardly pays off. 


A substantial saving in computing time can be realized by evaluating the arti- 
ficial dissipation term only once. This can be done by writing 
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Fig. 10.21. Stability domains of four-stage Runge-Kutta method with coefficients 
given by (10.100) (—) and (10.101) (- - -). 


R(V) = Q(V) — D(V) where D arises from the artificial dissipation term, 
and by defining in (10.99): 


RV) = QV) — D(VO) . 


We have applied the JST scheme to the test cases considered before. The 
coefficients are chosen as proposed by Jameson (1985a), and are given by 
(10.98) and (10.100). We will also present results with a larger value of k), 
in order to improve results near shocks. The step size is h = 1/48 in all cases, 
The time step is chosen close to the stability limit on an empirical basis. The 
upper half of Fig. 10.22 shows shock-induced wiggles. As shown in the lower 
half of the figure, these disappear if the second order artificial viscosity is 
increased sufficiently (k(?) = 8); the time step has to be decreased in order 
to maintain stability. We see that shock capturing is less crisp than with the 
schemes discussed earlier, but still the results of the lower half of the figure 
are good, although there is a slight violation of the entropy condition . 


Fig. 10.23 also shows wiggles for k?) = 1. Here k? = 6 was found to be 
sufficient to obtain smooth results, but in order to see whether good results 
can be obtained with the parameters fixed for all problems, we have used 
k(2) = 8 for the lower half of Fig. 10.23, as in Fig. 10.22 (it turns out there is 
very little difference between results with k(?) = 6 and k(?) = 8 in this case). 
Again, the shock is somewhat more smeared than with the schemes discussed 
earlier, but nevertheless the accuracy of the results with k!?) = 8 is good. 


Fig. 10.24 shows that the JST scheme does not give a sonic glitch (unlike the 
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DENSITY VELOCITY PRESSURE 





MACHNUMBER JST scheme 


lambda = 0.6 
k2 = 1 
k4 = 0.03125 





VELOCITY PRESSURE 





JST scheme 


MACHNUMBER 
lambda = 0.4 
k= 8 


k4 = 0.03125 





Fig. 10.22. Sod’s shock tube problem at t = 0.1667. Above: A = 0.6, ki?) = 4 
below: A = 0.4, kb) = 8. 
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ENTROPY JST scheme 
lambda = 0.2 
k? = 1 
k4 = 0.03125 
DENSITY VELOCITY PRESSURE 
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ENTROPY JST scheme 
lambda = 0.2 
k2=8 
k4 = 0.03125 





Fig. 10.23. Test case of Lax at t = 0.15. Above: À = 0.2, k?) = 1; below: À = 0.2, 
k = 8. 
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Fig. 10.24. Mach 3 test case at t = 0.0875. Above: A = 0.2, k? = 1: 
below: A= 0.15, k? = 8. 


454 10. The Euler equations in one space dimension 


DENSITY VELOCITY PRESSURE 






JST scheme 
lambda = 0.3 
k2=1 
k4 = 0.03125 





DENSITY VELOCITY PRESSURE 





JST scheme 
lambda = 0.3 
k2=8 
k4 = 0.03125 





Fig. 10.25. Supersonic shock tube problem at t = 0.15. Above: à = 0.3, k? =1: 
below: A= 0.3, kM) =8. 
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DENSITY DENSITY 





Fig. 10.26. Stationary contact discontinuity at t = 1 with A = 2. Left: k“) = 1/32; 
right : k® = 0. 


schemes discussed before), despite the fact that there is a slight violation of 
the entropy condition near the sonic point (z = 0.5; note that fluid particles 
cross this point from left to right). The small velocity wiggles can be made 
to disappear by increasing k'?) to k?) = 4, but again we show results for 
k?) = 8, for which it is found to be necessary to decrease the time step. 
The difference between results for k?) = 4 and k(?) = 8 was found to be 
negligible, but compared to k(?) = 1, Fig. 10.24 shows a slight decrease in 
accuracy, apart from the removal of wiggles, of course. Nevertheless, the re- 
sults compare well with those obtained with the Osher and van Leer schemes 
in this case. 


As seen in Fig. 10.25, also for the supersonic shock tube no sonic glitch oc- 
curs, although again these is a slight violation of the entropy condition near 
the sonic point. The wiggles in the velocity in the upper half in the figure 
dissappear when k'?) is increased to k{*) = 4 (not shown); almost the same 
results are obtained with k(?) = 8. Results for a stationary contact discon- 
tinuity (same data as for Fig. 10.16) are shown in Fig. 10.26. There is some 
smearing, but the numerical width of the contact discountinuity does not in- 
crease with time, as for the van Leer scheme. Since the pressure is constant, 
the second order artificial viscosity is zero. The small overshoot that occurs 
is caused by the fourth order artificial viscosity; when it is turned off the 
result of the scheme is exact. But for a moving contact discontinuity, with a 
vanishing artificial viscosity unacceptable wiggles occur (results not shown). 
Hence, the fourth order artificial viscosity has to be turned on, leading to the 
nonmonotone approximation shown in the left half of Fig. 10.26. This lack 
of monotonicity corresponds to the fact that the JST scheme is not TVD, as 
shown for the scalar case in Sect. 9.4. In Jameson (1988), Kim and Jameson 
(1995) a fourth order artificial dissipation term is presented that makes the 
scheme TVD; this TVD version was described in Sect. 9.4 for the scalar case, 
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and will not be further considered here. 


Summarizing, the results obtained with the JST scheme are of the same 
quality as those obtained with the schemes of Roe and Osher and the flux 
splitting schemes. Although slight violations of the entropy condition occur, 
no sonic glitches are present. Shock resolution is less crisp. It is necessary to 
choose the parameters governing the artifical dissipation right. The parame- 
ters given by Jameson (1985a) and listed in (10.98) are not adequate for the 
one-dimensional problems treated here; k(?} should be changed to k?) = 8. 


Exercise 10.7.1. Show that in the stationary case the JST scheme allows 
the solution H; = constant. 


10.8 Higher order schemes 


The Roe, Osher and flux splitting schemes are generalizations of first order 
upwind schemes from the scalar case to the case of systems. As a consequence, 
these schemes are first order accurate in space. In order to improve accuracy 
and resolution of contact discontinuities, it is necessary to make these schemes 
second or higher order accurate in regions where the solution is smooth, 
while avoiding spurious wiggles. In Sect. 9.4 we discussed a good way to do 
this for the scalar case, namely the MUSCL (monotone upwind schemes for 
conservation laws) approach proposed in van Leer (1979), van Leer (1984). 
This approach generalizes easily to the case of systems. 


The MUSCL approach 


Suppose we have a first order spatial discretization, which leads to the fol- 
lowing semi-discrete system: 


dU; 1 
-y tgu- Fi-12) = 0, Fy4ij2 = F(U;, Uj41). 


Just as in the scalar case discussed in Sect. 9.4, the accuracy may be improved 
by inserting extrapolated left and right states in the numerical flux function: 


Fy41/2 = F(Uf 472, Uj41/2) ° 


To avoid spurious wiggles, the slope limited extrapolation (9.98) is applied 
to each of the state variables separately. For instance, 


l 1 ] 
Py41/2 =p; + 5M (rs) (e541 ~ pj) Pj41/2 = Pj+1 + zele — Pj+1) » 


r+ 
rj — Pi Pi- 
Pi+1 — Pj 


10.8. Higher order schemes 457 


The limiter y(r) is chosen such that where the solution is smooth a higher 
order scheme, for instance the «-scheme with a specified value of «x, is emu- 
lated. 


Numerical stability 


In order to derive stability conditions, we use the same heuristic reasoning as 
in Sect. 10.6, and require the scheme to be stable for the scalar conservation 
law 


ap afle) 
Ot Or 


where we do not specify f(y), except that the wave speed df/dy is assumed 
to be equal to the largest wave speed for the hyperbolic system under con- 
sideration. For the Euler equations this leads to 

df 


dp = lul+e. (10.103) 





=0, (10.102) 


In Sect. 9.4 conditions were found for schemes for (10.102) to be TVD and 
hence stable. Although the TVD concept does not carry over to systems, it 
is found in practice that a scheme that is TVD for the scalar case is usually 
free from spurious oscillations in the systems case. 


As an example, we choose the van Albada limiter (van Albada, van Leer, and 
Roberts (1982)): 


Wr) =(r?+r\/l+r?),r>0; v(r)=0,r<0. 


Since % (1) = 1/2, this gives us the x = 1/2 scheme in smooth parts of the 
flow. Simple analysis shows that we have 


wn) < (1+ V2)/2 = 0.85, 


0 < ¥(r) < (1+ V2)/2. 


The graph of {r} is shown in Fig. 10.27, together with the lines y(r) = 
r(1 + J2)/2 and y(r} = r. It is clear that the graph of the van Albada 
limiter is in the admissibility region of Fig. 4.12. 


0< 
= (10.104) 


It was shown in Sect. 9.4 that the explicit Euler scheme is TVD if the graph 
of u(r) is in the admissibility region of Fig. 4.12, which is the case, and if the 
time step is small enough (cf. (9.107)): 


BS GM M>1+max{y(r)/r: r>0}, 


a =sup(lul+c). (10.105) 
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Fig. 10.27. Graph of van Albada limiter. 


It follows from (10.104) that a suitable value of the constant M (not to be 
confused with the Mach number) is 


M = (34 ¥2)/2 = 1.85. 


It was shown in Sect. 9.4 that the Runge-Kutta schemes of Heun (9.111) and 
of Shu and Osher (9.112) are TVD under the same conditions as the explicit 
Euler scheme. The bounds A; on A that follow from (10.105) for the four test 
problems discussed before are listed in Table 10.3. We see that these bounds 
are much more strict than those of Table 10.2. 


Sod 
Lax 


Mach 3 
Supers. shock tube 





Table 10.3. Estimates for ji and TVD bounds à; for 4 for the explicit Euler scheme 
with the MUSCL method using the van Albada limiter. 


If the time stepping scheme is not TVD, or if one wishes to increase the 
time step, the TVD requirement may be replaced by the requirement of 
stability, which is often weaker. A heuristic stability criterion is obtained 
for MUSCL schemes as follows. We look for stability of the scheme for the 
scalar advection equation (10.94). In smooth parts of the flow the scheme 
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switches to the underlying «-scheme, otherwise the scheme switches to the 
first order upwind scheme. We therefore require the time stepping scheme to 
be stable for both schemes. The symbol of the first order upwind scheme is 
given by (10.94). The symbol of the x-scheme applied to (10.94) is given by 
(cf. Sect. 5.6): 


£n(8) = =(n(0) +i), nl) = 2o(1—)5?, 
; (10.106) 
72(0) =o{(1—«)s+1}sin@, s= sin? 59: o=pr/fh. 


Since the locus of —rL,,(0) as given by (10.106) is close to the imaginary 
axis, it is desirable to use a time stepping scheme with a stability domain 
that contains part of the imaginary axis. To see what happens if we use a time 
stepping scheme that is not (known to be) TVD, we select the SHK Runge- 
Kutta method described in Sect. 5.8. The stability domain S is depicted in 
Fig. 5.9. The intersections with the real and imaginary axes are at —2.7893 
and i/8. For the first order upwind scheme, the locus of —rL,(0) is a circle 
with center at —o and radius o = ji; this circle is inside S if o < 1.3926, 
which leads to the following stability condition: 


A < 1.3926/j. (10.107) 


Using Theorem 5.7.4, we find that for the x-scheme —rL,,(9) is inside the 
ellipse of Fig. 5.9 if 


with a = 2.7853 and b = 2.55. This gives, with x = 1/2, 
Aà <à = 1.2/p. (10.108) 


It is left to the reader to show that the use of the rectangle of Fig. 5.9 
leads to a more restrictive stability condition. By comparision of (10.107) 
and (10.108) we see that both the first order upwind and the x = 1/2 scheme 
are stable if (10.108) is satisfied. The resulting values of A, for the four test 
cases are listed in Table 10.4. These bounds were indeed found to be sufficient 
in the numerical tests to be described next. Note that the stability bounds 
of Table 10.4 are much weaker than the TVD bounds of Table 10.3, but 
the TVD property is not guaranteed. Nevertheless, spurious oscillations were 
found to be absent in the numerical experiments presented below. 


Numerical experiments 


In all cases, the spatial step size satisfies h = 1/48. Figs. 10.28-10.31 show re- 
sults for the MUSCL version of the Roe scheme, without a sonic entropy fix. 
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Fig. 10.28. Sod’s test problem with Roe scheme; t = 0.15, \ = 0.6. 
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Fig. 10.29. Test case of Lax with Roe scheme; t = 0.144, à = 0.3. 
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Fig. 10.30. Mach 3 test case with Roe scheme; t = 0.0875, à = 0.3. 
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lambda = 0.4 t = 0.15833 
Primitive extrapolation 
SHK Runge-Kutta 
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epsilon = 0 


Fig. 10.31. Supersonic shock tube problem with Roe scheme; t = 0.158, 
A = 0.4. 
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Osher scheme, O—variant 


lambda = 0.5 t = 0.16667 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
flopcount = 680081 


0 05 : “0 0.5 
Fig. 10.32. Sod’s test problem with Osher scheme; t = 0.167, à = 0.5 
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ENTROPY Osher scheme, O-variant 


lambda = 0.25 t= 0.14583 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
flopcount = 1190619 
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Fig. 10.33. Test case of Lax with Osher scheme; t = 0.146, A = 0.25. 
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Osher scheme, O-varlant 


lambda = 0.3 t = 0.0875 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
flopcount = 535157 
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Fig. 10.34. Mach 3 test case with Osher scheme; t = 0.0875, A = 0.3. 
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Fig. 10.35. Supersonic shock tube problem with Osher scheme; t = 0.15; 
A = 0.4. 


464 10. The Euler equations in one space dimension 


DENSITY VELOCITY PRESSURE 





MAGHNUMBER ENTROPY Liou-Steften scheme 


lambda = 0.5 t = 0.16667 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
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Fig. 10.36. Sod’s test problem with Liou-Steffen scheme; t = 0.167, 
A = 0.5 

DENSITY VELOCITY PRESSURE 






Liou-Stetten scheme 


lambda = 0.2 t=0.15 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
flopcount = 1079276 





1 0 0.5 
Fig. 10.37. Test case of Lax with Liou-Steffen scheme; t = 0.15, À = 0.2. 
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lambda = 0.3 t = 0.0875 
Primitive extrapolation 
SHK Runge-Kutta 
MUSCL, van Albada limiter 
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Fig. 10.38. Mach 3 test case with Liou-Steffen scheme; t = 0.0875, A = 0.3. 
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Primitive extrapolation 
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Fig. 10.39. Supersonic shock tube problem with Liou-Steffen scheme; 
t = 0.15, A = 0.3. 
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Lax 
Mach 3 
Supers.shock tube 





Table 10.4. Estimates for ji and stability bounds 4; for À for the SHK Runge- 
Kutta method with the MUSCL method using the van Albada limiter. 


Comparing with the corresponding figures in Sect. 10.3, we see a better res- 
olution of contact discontinuities, and the sonic glitch has almost disapeared 
for the supersonic shock tube problem. The violation of the entropy condition 
for the Mach 3 test case is still unacceptable, but has become somewhat less 
severe. With MUSCL, the sonic entropy fix of Harten needs to be less drastic 
than when MUSCL is not applied; with ¢ = 0.2 we obtain better results (not 
shown) than in Fig. 10.10. 


Figs. 10.32-10.35 show results for the MUSCL version of the Osher scheme 
(O-variant). Compared to the results obtained in Sect. 10.4, the accuracy has 
improved and the sonic glitches have almost disappeared. 


Figs. 10.36-10.39 show results for the MUSCL version of the Liou-Stefien 
(AUSM) scheme. The accuracy improvement is similar to that of the other 
schemes. Sonic glitches disappear. 


Summarizing, the accuracy of the three schemes improves by application of 
MUSCL. The Liou-Steffen scheme has the best accuracy and is the cheapest 
of the three. The Roe scheme is as accurate as the Osher scheme, except 
when a strong expansion fan is present, as in the Mach 3 test case; this can 
be cured by a sonic entropy fix. The price we pay here for MUSCL is a more 
expensive time stepping scheme; because the CFL number (u+c)7r/h cannot 
be much larger than for explicit Euler time stepping with the non-MUSCL 
schemes, and because each Runge-Kutta step consists of four Euler steps, the 
amount of computing almost quadruples. 


Exercise 10.8.1. Use the rectangle of Fig. 5.9 to obtain the following suffi- 
cient stability condition for the x-scheme : 


a<1(2— 2k), 


where g is the CFL number. 


11. Discretization in general domains 


11.1 Introduction 


In the foregoing chapters we have restricted ourselves to Cartesian compu- 
tational grids, in order to avoid unnecessary technicalities in discussing the 
basic principles of computational fluid dynamics. In the present and in the 
following two chapters, we will go into the complications brought about by 
geometric complexity of the domain in which the flow takes place. These 
complications are mainly of a technical nature, and no new basic principles 
will emerge. Nevertheless, for successful computation of flows in general do- 
mains, the technicalities involved in computation on general grids must be 
considered carefully. 


We will use boundary-fitted coordinates for creating grids in general domains. 
Simple and exact expressions will be derived for cell volumes and cell face ar- 
eas. Tensor analysis is a valuable tool for expressing physical laws in general 
coordinates. Therefore we include an introduction to tensor analysis for the 
uninitiated reader. In the following two chapters, discretization of the gov- 
erning equations of fluid dynamics on general (structured) grids will be pre- 
sented, both for the staggered and the colocated placement of the dependent 
variables. We will see how the basic principles established in earlier chapters 
using simple grids can be used for developing discretizations on general grids. 
The issue of how to deal with rough (i.e. strongly non-uniform) grids and the 
effect of grid roughness on accuracy, already discussed in sections 3.7 and 4.7, 
will come up again, this time in a more general context. 


11.2 Three types of grid 


One of the distinguishing features of practical computational fluid dynamics is 
the geometric complexity of the domains in which flows take place. Examples 
are flows around aircraft, in turbines, in estuaries, and physiological flows, 
such as in hearts and arteries. There are many ways to discretize in general 
domains. We will give a brief survey of the various possibilities, and go into 
more detail about structured grids. 
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Discretization always entails subdivision of the domain in small cells, or, 
which amounts to the same thing, distribution of a finite set of points in the 
domain. The resulting set of cells or points is called the (computational) grid. 
There are basically three types of grid: 


— Cartesian 
— Structured boundary-fitted 
— Unstructured 


These three types are illustrated in Fig. 11.1. In a Cartesian grid the cells 
are rectangular. A grid is called structured if all interior cell vertices belong 
to the same number of cells. Hence, grids (a) and (b) of Fig.11.1 are both 
structured. 


If the boundary consists of cell faces the grid is called boundary-fitted. Un- 
structured grids are invariably boundary fitted. In Cartesian grids the domain 
boundary usually does not coincide with cell faces, which makes accurate im- 
plementation of boundary conditions complicated. However, in applications 
where the precise location of the boundary is unknown or of little impor- 
tance, the boundary may be approximated by Cartesian cell faces. Fig. 11.2 
gives an example. This example concerns the tidal flow in an estuary with 


(b) 


(a) (c) 


Fig. 11.1. Three types of grid: (a) Cartesian; (b) structured boundary-fitted; (c) 
unstructured. 


tidal flats. In this case the boundary between land and water is vague, and 
no harm is done by using the ‘staircase’ approximation for the land-water in- 
terface shown in the figure. But Cartesian grids may also be employed when 
the domain boundary is given precisely, and cuts in an arbitrary way through 
the grid. An application of the Cartesian approach in the hyperbolic case is 
given by Coirier and Powell (1995), Deister, Rocher, Hirschel, and Monnoyer 
(1998) and Ye, Mittal, Udaykumar, and Shyy (1999), and reviews may be 
found in Henshaw (1996) and Berger and Aftosmis (1998). 


11.2. Three types of grid 469 





Fig. 11.2. Cartesian grid for tidal flow in estuary with tidal flats. Only dry cells 
are shown. 


Advantages and disadvantages 


The advantage of structured grids over unstructured grids is, that datastruc- 
tured are simpler and no indirect addressing is required, leading to smaller 
computing times. Furthermore, fast multigrid iterative solution methods are 
easier to use. On the other hand, in complicated three-dimensional domains 
the construction of unstructured grids is far easier and requires far less time 
from the user than the construction of boundary-fitted structured grids, 
which may require weeks of work. This seems to be leading at present to 
a growing popularity of unstructured grids. In such grids the cells need not 
be triangles, as in Fig 11.1(c), but can be quadrilaterals as well. In three 
dimensions we may have tetrahedrons, hexahedrons, prisms etc. 


Of the three kinds of grids listed above, Cartesian grids are least popular. 
But they are easiest to generate and away from the boundaries discretization 
is also the easiest. Accurate discretization near the boundaries is more diffi- 
cult than for the other two options. Nevertheless, interest in Cartesian grids 
seems to be growing; see for example the references just quoted. 


Structured boundary-fitted grids seem to dominate at present, and we will 
concentrate exclusively on this approach. Much of the analysis that follows 
is also useful for the unstructured case. 
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11.3 Boundary-fitted grids 


Boundary-fitted coordinate mappings 


In order to generate a structured boundary-fitted grid, a coordinate trans- 
formation 


T:2 = 2(€) (11.1) 


is constructed, that maps a computational domain G onto a physical domain 
N. G is a rectangle (or a rectangular hexahedron in 3D). Here æ = (x1, z?, x?) 
are Cartesian, and € = (€!,€?,€%) are general curvilinear coordinates. The 
mapping is such that the boundary 092 consists of segments where £*% = 
constant for some a. How to generate T will be remarked upon later. Fig. 11.3 
gives an illustration of the principle. Fig. 11.4 gives a realistic example. 


xh EA 


—» x! —> E! 


Fig. 11.3. Mapping of physical domain on computational domain. 


Orthogonal coordinates 


By a £°-coordinate curve we mean a curve æ = æ(Ẹ) in 2 along which €? 
= constant, 8 # a. In other words, along a £°-coordinate line €% is the only 
-component that varies. If the angle of intersection of every E°- and E8- 
coordinate curve with 3 Æ a is orthogonal, the -coordinate system is called 
orthogonal. Note that here the -coordinate curves are considered in 92; in G 
they are orthogonal per definition. 


Conformal mapping 


We consider the two-dimensional case. Let z = £! + iz*,w = €' + i€?, with 
i= /—1. A mapping 


z = f(w) (11.2) 
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Fig. 11.4. A boundary-fitted grid around an airfoil. 


£! = Re{ f(E + i€?)}, £? = Im{ f(E} + i€)} 


is called conformal if f is analytic. The name conformal derives from the 
fact that in the small figures retain their shape under (11.2); they are merely 
rotated and stretched a bit. In particular, angles between intersecting curves 
are preserved. As a result, the -coordinates are orthogonal, like x. Conformal 
mapping is a classic topic in complex function theory, and was an important 
tool in fluid dynamics before the arrival of the computer. It is restricted to 
two dimensions. 


In general, the equations of motion become more complicated when €° are in- 
troduced as independent variables, but this is hardly the case if the -system 
is orthogonal. Hence one might think that conformal mapping is an attractive 
way to generate boundary-fitted grids. But this is not so, because it leaves 
little freedom to put grid points where we want them. So conformal mapping 
is little used, 


Orthogonality is a weaker condition than conformality, as illustrated in Exer- 
cise 11.3.1. However, it turns out that mappings that are merely orthogonal 
are also not sufficiently flexible in practice. But it is quite feasible to generate 
grids that are orthogonal at the domain boundary only. This facilities accu- 
rate implementation of the boundary conditions. But we will consider only 
the general case in what follows. 
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Domain decomposition 


A non-singular mapping T (11.1) exists if 2 and G are topologically equiva- 
lent. This means that one can obtain G from R by continuous deformation 
without cutting. But even when this is not the case, a suitable mapping can 
be constructed by introducing suitable cuts in 92, cf. Exercise 11.3.1. If the 
mapping remains complicated and the resulting distribution of grid point 
undesirable, the number of cuts may be increased, subdividing 2 in subdo- 
mains, each of which is mapped separately on its own rectangle G. The do- 
mains may overlap. The conditions by which the solutions in the subdomains 
are connected to each other are subject of much research. Apart from grid 
generation in domains of complicated shape, other reasons to apply domain 
decomposition are: 


— Zonal method: different mathematical models are applied in different sub- 
domains, in order to reduce computing cost. 
— Parallel computing: each subdomain is handled by a different processor. 


Domain decomposition will not be further discussed. 


Generation of boundary-fitted grids 


Grid generation is a large subject by itself. We will say little about it, but refer 
the reader to the following monographs: George (1991), Knupp and Steinberg 
(1993) and Thompson, Warsi, and Mastin (1985). Recent reviews are given 
by Henshaw (1996) and Spekreijse (1995). Excellent sources of information 
on the Internet are R. Schneiders’ Mesh generation & grid generation on the 
Web site: 
www-users.informatik.rwth-aachen.de/"roberts 
/meshgeneration.html 
and P. Heckbert’s collection of mesh generation links: 
www.cs.cmu.edu/~ph/mesh. html 
and the Grid technology site of Mississippi State University: 
www.erc.msstate.edu/research/thrusts/grid/ 
Here pointers to the literature and software packages may be found. 


Piecewise multilinear coordinate mapping 

Let the physical domain 92 be topologically equivalent to the unit cube G. 
In G we have Cartesian coordinates € = (€1, £7, £) and a uniform grid Ghr 
consisting of grid points located at €;, j = (j1, j2, Js): 


Gr = {Ẹ; : EF = ja AE“, ja = 0,1,...,1/AE*, @ = 1, 2,3}. (11.3) 
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where 1/Aé® € N. Greek indices are used exclusively to refer to coordinate di- 
rections, and vice-versa. Unless stated otherwise, summation is implied exclu- 
sively over pairs of equal Greek sub- and superscripts in terms and products. 
This summation convention does not apply to (11.3). The two-dimensional 
case is subsumed under the three-dimensional case by restricting the range 
of a to {1,2}; in this case G is of course the unit square. 


Boundary-fitted grid generation is a numerical process that generates the 
coordinate mapping 


T:æ; =æ), ENR, € €Gr (11.4) 


only in the discrete set of points Ga C G. In order to obtain accurate dis- 
cretizations, we have to be precise about how the mapping (11.4) is extended 
to all of 2 and G. This is necessary in order to discretize correctly near 
grid nonuniformities, and to compute cell volumes and cell face areas exactly 
and in a consistent manner. The extension should preserve the one-to-one 
property of the mapping, and allow exact and efficient computation of cell 
volumes and cell face areas. 


We recall from Sect. 3.7 that if the mesh size jumps, the local discretiza- 
tion error for second derivatives is of first order for vertex-centered and of 
zeroth order (which makes the scheme inconsistent in the maximum norm) 
for cell-centered schemes for the convection-diffusion equation (Manteuffel 
and White, Jr. (1986)). This has made some believe that grids need to be 
smooth for accurate results. But this is not so. In Forsyth, Jr. and Sam- 
mon (1988), Manteuffel and White, Jr. (1986), Weiser and Wheeler (1988), 
Wesseling (1996b) it is shown that the global discretization error is second 
order on strongly nonuniform grids. These results for the convection-diffusion 
equation may be expected to carry over to the Navier-Stokes equations. This 
is fortunate, because it allows us to switch abruptly from a fine mesh in thin 
boundary or shear layers to a coarse mesh outside. For such grids, in Farrell, 
Miller, O’Riordan, and Shishkin (1996), Roos, Stynes, and Tobiska (1996), 
Wesseling (1996b) it is shown that the accuracy is uniform in the Reynolds 
number, for the convection-diffusion equation, as we saw in Sect. 4.7. In order 
to allow abrupt changes in mesh size, we will assume the coordinate mapping 
to be merely piecewise differentiable. We will therefore extend the mapping 
by multilinear interpolation. 


Exercise 11.3.1. Let N be the concentric annulus between r = 1 and 
r = 2, with r = (zle! + 2?2?)'/?, Let £1 = r, € = 0 with (r, 8) cylin- 
drical coordinates. Determine the form that relation (11.1) takes in this case, 
and show that the image G of 9 is a rectangle. Determine where N has 
been cut. Show that the mapping is not conformal by comparing the angle 
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of intersection y of two curves 6 = constant, z! = 1 in the 2-plane (answer: 
p = 1/2 — 0) and in the -plane (answer: p = arctan(cos 8/ tan 8)). 


Exercise 11.3.2. Show that for piecewise multilinear coordinate mappings, 
cell edges are straight in physical space. 

Hint: a cell edge has the property that only one component of € varies along 
it. 


11.4 Basic geometric properties of grid cells 


A detailed account will be given of accurate and efficient. computation of cell 
volumes, cell face areas and cell face normals. This is of obvious importance 
for finite volume discretization. This elementary topic has of course been 
discussed before; see the excellent review of discretization in physical space 
by Vinokur (1989), and references quoted there. But the topic still warrants 
attention, because (in three dimensions) many authors define a cell face to 
consist of two plane triangles, see for example He and Salcudean (1994). 
This is less satisfactory than the doubly-ruled surface that we will use. The 
resulting formulae are not more economical, and a face can be subdivided into 
triangles in two ways, and two adjacent cells have to fit, which complicates 
grid generation; furthermore, the normal is ill-defined. Therefore we feel that 
the approach taken here, in which the mapping is extended by multilinear 
interpolation, so that cell faces turn out to be doubly-ruled surfaces, is to be 
preferred; the more so because the resulting expressions for areas and volumes 
are cheap, as will be seen. 


Two-dimensional case 
Let 92; E€ 92 be a geometric figure to be further defined shortly with vertices 


Bi-—e,—e2; Bjt+e;—e2; Vj—-eytez, Bj+eiterz, “1 = (1/2, 0), €2= (0, 1/2) 


(since the vertices have integer indices, this implies that jı and jz are frac- 
tional). The image of 92; in G is a rectangle called G;; G; and 22, are called 
cells. The cell G; is defined by (cf. (11.3)) (no summation): 


Gy = {€ = (C8) 2 € [ia — SAE, (a + SAE 


Let € be some point in G;. Bilinear interpolation in 9; gives the following 
relation between x and Ẹ: 


æ = £0 + Cal” — £5) + crl" — Eo) (€? — E) . (11.5) 
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Since this mapping is different in every cell, it is piecewise bilinear. It is 
also continuous. The edges of G; are straight and on every edge, £! or £? 
is constant. This makes the mapping (11.5) linear at the edges, so that the 
edges of 92; are also straight, as indicated in Fig. 11.5. The coefficients x9, Ca 


a E! 
1 5) 1 2 


Fig. 11.5. Grid cell in physical space (92;) and in computational space (G;). 


and c12 follow from the requirement #(€,) = @m, m = 1,2,3,4, where 
the subscript m refers to the vertices enumerated in Fig. 11.5, and are easily 
determined as follows. Choose £o in the center of G;, and define new variables 
s* by 
laa . 
E° = EF + 5 Ab s* (no summation), 

so that (11.5) becomes 

æ = æo + bas? + bi28's" , (11.6) 
with 

1 a 1 1 2 : 
ba = 7^4 Co, b = gas AE cı2 (no summation). 


By requiring #(€m) = #m, M = 1,2,3,4, we obtain the following system of 
equations: 


Ay=r, (11.7) 
with y = (b1, b2, by2, 20)", r= (#1, £2, %3, a4)? and 
—1 -l 11 
1-1 —i i 
A= } 1 1 1 
—İ l1 —1 1 


The columns of A are orthogonal. Premultiplication of (11.7) by AT gives 
y= ŁAT r, so that we have, defining amp = @m+2n: 


bı 
biz 


4(#23 — æ14), b2 = (234 — £12), 
q(®13 — %24), @o = 7(#12+ ®34). 
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Geometric quantities 


For later use, we need to determine various geometric quantities related to 
the coordinate mapping. 


The Jacobian of the mapping is defined by 


J 3 det{ 5) = Fr ® 5a (11.8) 
where the operator & is defined as 
a@b=a'b? -ab . 
It is assumed that 
J>0, (11.9) 


i.e. the æ- and €-coordinate systems are right-handed. 


By the definition of the outer product, in three dimensions the vector a x b 
has magnitude equal to twice the area of the triangle spanned by a and b; 
hence, in two dimensions a Q b with a and b ordered counterclockwise equals 
twice the area of the triangle spanned by a and b. It follows that the cell area 
is, with the vertex enumeration of Fig. 11.5, 


l 
| 2; | = 5 (3 — #1) O (£4 — æ2) ; (11.10) 


where we have taken advantage of the identities a & a = 0 and a & b = 
—bQa. The vector 3smn normal to a cell edge x, — æn with length [æm — æn! 
will be called the cell edge vector. For complete specification, we add the 
requirement, assuming £* = constant on £m — Œn, that S35mn points into the 
cell where £°(3 Æ a) is largest. It follows that (cf. Fig. 11.5) 


2 2 2 
_ Ly — £4 — t3 — t2 
$12 = 1 1 J» $23 = 1 1 ; 
Ty — Ly Lo — T3 
(11.11) 
r? — zr? 2 n? 
— 4 3 _ U4 — Ty 
834 = 1 1 jJ; $14 = 1 1 . 
t3 — T4 ty — t4 


Later, we will need the so-called contravariant and covariant base vectors. 
The covariant base vectors a.) are defined by 


_ Oz a _ ôx? 
Me) = Fea or (a) = gga? 


with at.) the Cartesian z°-component of the vector (a). We put parentheses 
around a to emphasize that no component is intended. Because æ = aw (€) 
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is piecewise bilinear, a.) is piecewise continuous. In the cell interior and 
on edges £°= constant, 8 £ a, @(q) is continuous, but aa) is discontinu- 
ous at cell edges €* = constant. For instance, agı) is discontinuous at the 
cell edge #3 — æ> (cf. Fig. 11.5). We will need a.) only in the cell center 
fc = (x4 + Œ + #3 + #4) and at those cell edge centers where Aq) Is 
continuous. For mnemonic convenience, cell edge centers are indicated by 
subscripts N, W, S, E, where N stands for north etc., so that for instance 
en = 4(#3 + a4). In a we have in (11.6) st = s? = 0, in ey we have 
s! = 0, s? = 1 etc. It is easily seen that we have exactly 


anc = (æg —æw)/Aft, aayc = (an — ws)/AE?’, 
aajyn = (æ3 — æ4)/ A£}, aajys = (æ — #1)/Aé}, (11.12) 
aje = (æ3-— @2)/AE?, ayayw = (#4 — 21)/AE?. 


The contravariant base vectors are defined by 


(a) — 3E“ 
B T @r8 ’ 





a = Vé* or a 


(a) 


where a,” is the Cartesian z°-component of the vector a'*). We have 


a'®) ars) = 65 , 


with 63 the Kronecker delta. Solving this equation gives 


Ifa 1 [{ -a? 
(4) — (2) (2) — (1) (11.13) 
a) = , al = ; . 
vI (2 5 ( afa) ) 
where ,/g is the common designation in tensor analysis for the Jacobian J 
defined in (11.8), i.e. 


Vg = @(1) © aiz). (11.14) 


From the smoothness properties of aq) it follows that /g, a!) and Vga!) 


are discontinuous at cell edges, except that ,/ga‘®) is continuous at cell edges 
of the type €* = constant. From (11.13), (11.12) and (11.11) it follows that 
sga) is closely related to cell edge vectors: 


(Jga a = sg/AẸ, (yga )w = sw/Aé?, 
(yga) )y = sy/Aé}, (/ga)5 = s5/Aé!, 


where sy = 834 etc. 


Not surprisingly, ,/g is closely related to the cell area. We have, using (11.14), 
(11.13) and (11.10), 


Jac Aé' AE? = (æg — ew) ® (en — #5) = 


1 
-= q(@2 + ws — æ — w4) @ (æ3 + æ4 — æ ~ æ) = Nj. 
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Geometric identity 


According to the divergence theorem, we have for a function f(z): 


f saajan = [ mar, 


2; T; 


where n is the outward unit normal. Choosing f (æ) to be constant, this gives 


[nar =o. (11.15) 
T; 
This is called the geometric identity. It expresses the fact that the contour I; 


is closed. It is desirable that (11.15) is satisfied exactly when I; is approxi- 
mated numerically. We find indeed from (11.11): 


[ nar = 243+ 823-801-814 = 0. 
Pj 


This is a necessary condition for uniform flow to be an exact solution of the 
numerical scheme. 


Three-dimensional case 


In three dimensions, the mapping (11.1) is extended by piecewise trilinear 
interpolation. Let 9; be a geometric figure to be further defined shortly with 
vertices 
# jt(eiteates)s wPt(e1~ sates): “Ft (ertea es) 3 i 
Pit{-eitez+ea) E1 = (519, 0), Ez = (0, 5:0) €3 = (0, 0, 5) . 
Since the vertices have integer indices, this implies that jẹ is fractional. 
In the following, æ; will be a cell center, #;,., will be a cell face center, 
Litestes (B É a) will be a cell edge center, and #;4¢,4e,+e, will be a cell 


vertex. The image of 9; in G is a rectangular hexahedron called G;. The cell 
G; is defined by (no summation): 


Gy = {€ = (EEE): € [ja — FAL, (ja + SAE}. 


Let o be some point in G;. Then trilinear interpolation inside 2; gives the 
following relation between æ and &: 


æ = £o + Call” — ES) + c12(E* — EME? — EZ) + c23(€? — ERNE? — 63) 


+ c13(€° — &)(€* — Eo) + craa(é* — EGNE? — ENE? — 6) . 
(11.16) 
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It follows that the edges of the cell 92; are straight. Consider a face of 12; 
with £? = constant. In this face the mapping (11.16) becomes, choosing £o 
in the corresponding face of G; so that we have €° = &@: 


æ = æo + cil! — £b) + clé? — &) + era(€? — ENE? —€), (11.17) 


with coefficients æọ and c in general different from those in (11.16), since Ẹo 
is changed. With £! = constant or £? = constant equation (11.17) describes 
straight lines, so that the cell face contains two families of straight lines, and 
is therefore a doubly-ruled surface. 


The following change of variables is convenient: 
1 
E% = ES + Ase s" (no summation) , (11.18) 


with £o in the center of G;. By rewriting the mapping (11.16) in the local 
s-coordinates we obtain: 


£ = Œo + bas + by28!8? + bigs! s? + bo387s* + biz3s's’s° , 
with (no summation) 
1 a 1 a B 
ba =5 46 Co, bag = 4486 AE Caps 


(11.19) 
b123 = ZAG AE Ag e123 . 


Requiring æ = æm, M = 1,2,...,8 (see Fig. 11.6) gives the following system 
of equations: 


Ay =r, (11.20) 


T 


and 


with y = (by, bz, bs, biz, bi3, b23, b123, wo)", r= (æ1, DP ee £g) 





l l 4 
Fig. 11.6. Three-dimensional cell. Its image in G is a rectangular hexahedron. 
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-l-l1-1 1 1 1-11 
-1 1-1-1 1-1 11 
1 1-1 1-1-1 -11 
a- | Pcl -lb-1-1 1 11 
-1 -1 1 1-1-1 11 
-1 1 1-1-1 1-11 
i 1121221 1 «212 
i -1 1-1 1-1-11 


The columns of A are orthogonal. Premultiplication of (11.20) by AT gives 


y= gr, 
so that 
by = 3(#3478 — @1256), bz = $(@2367 — 21458); 
b; = i (#5678 = #1234), biz = 5 (®1357 — £2468), 
big = 5 (#1278 — #3456), b23 = t (#1467 — £2358), (11.21) 
biog = $(æ2457— £1368), 
where 


Bjikmn = Bj H Ek Em + En. 


This completes the specification of the coordinate mapping in three dimen- 
sions. 


Geometric quantities 


For the Jacobian we can write 


Ow Ox Ox O@ 
J = det —} = — -(— x —). 11.22 
Again, we assume the æ- and €-coordinate systems to be right-handed, so 


that (11.9) holds. 


We begin with the cell volume. The volume of the parallelepiped spanned by 
three vectors a,b and c is a- (b x c). Hence, the volume element generated 
by infinitesimal] changes in € is 
Oe Ov O@ 
dQ = = + (a5 X az) dE dEeRdEP . 
p 061 (Hea * ge) £ £ 

This is positive, because right-handed coordinate systems are assumed, cf. 
(11.22) and (11.9). It follows that the cell volume is given by 
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|\Q;|= | dQ= | Jdé'dé*de? . (11.23) 
[oo | 


Nj 


Let us choose £o in (11.18) in the center of the cell G;. Then the ranges of 
integration in this triple integral are (£f — tAE”, Eg + 5 AE), a = 1,2,3, 
with AE® the length of the edges of G;. The following change of variables is 
convenient: 


E~ = EF + FAE%s* (no summation), (11.24) 
so that (11.23) becomes 


1 1 1 
Q;| = LAELAE? Ae Jds!ds?ds? . 
J 


~1-—1-1 


According to (11.16) we have 


Ow 

bet = cı + d128° + digs® + di238"s° , 

Ow 1 3 3.1 

JEZ = c2 +dz215 + dogs” + d2315°8 , (11.25) 
ð 

a8 = c3 + d31s' + d3zs” + dgi2s's? , 


where 


di2 = $AE en, di3 = AEP C13, diz3 = FAC? A€Per23 , 
dx, = 4AE'cy2, do3 = 4Ab% e23, d231 = GAC AE e123, (11.26) 
d3 = 4AE*ci3, d32 = 5AE? C23, d312 = 1AE! AE e123 « 
Because of symmetry (which is why o was chosen in the center), terms that 
contain odd powers of s® do not contribute. Neglecting these terms, equation 
(11.22) gives 
J ~ o1-(c2 x e3) + siste,: (do; x d3}) 
+87s*dj2 + (c2 x d3z) + s°s°di3 - (d23 X c3) 
+s's's?5*s°s%di23 + (d231 X d312) , 


where the symbol ~ indicates that terms have been neglected. Because the 
dagy vectors are parallel, the last term is zero. Integration gives, using (11.26), 


|92;| =AE* AEP AEF {c1 - (c2 X c3) + ZAE Atc: + (c12 X c13) 


(11.27 
+ ZAE? AE? c12: (c2 X c23) + 2 AE? AEF c13 + (c23 X €3)} . ) 
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We want to write (11.27) in terms of the location of the cell vertices. Substi- 
tution of (11.19) in (11.27) gives 


|2;| —8b, . (bə x b3) + SIb . (biz x bi3) 


11.28 
+ bız: (b2 x b23) + bis - (b23 x b3)} . ( 


This formula is exact. Before giving a more efficient expression we first give 
a formula for the area of a cell face multiplied by its outward unit normal, 
which will call a cell face vector. 


Consider the cell face 1234 (cf. Fig. 11.6). In this cell face the coordinate 
mapping can be written in the form given by (11.17). The vector normal to 
the face in a point æ in the positive €9-direction with length equal to the area 
of the parallelogram spanned by two infinitesimal displacement vectors dx (1) 
and dz 2) is given by dai) x dæ2), so that the average normal vector with 
length equal to the area of the face 1432 (which we call the cell face vector) 
is given by 


$1432 = f ndľ = f dæ x aeg. (11.29) 


1432 1432 


The indices are put in parentheses to emphasize that no components are 
intended. Introducing s* defined by (11.24) gives 


dz (.) = dé* (no summation}, 


Ow 
OE* 
with 

ð ð 

Je = ĉ&ı + dys”, Je = c + dos' ; 


where dı2 and d2; are defined by (11.26). Substitution in (11.29) gives 


$1432 = ragag f f2 x 2 {Z ds!ds? = AE! AE? cI x c3. 


-1—1 


The vectors cı and c3 are easily determined as follows. The transformation 
(11.17) can be rewritten as 


æ = o + t Attest + 4 AE? c28? + I AEAEE? e128" 8? . 


Requiring æ = æm, m = 1,2,3,4, gives a system of equations that is easily 
solved in the same way as (11.20). The result is 


] 1 
cj = Jaa 34 — #12), c2 = Jag”? — £14) ’ 
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where jm = 2j +@m. Using ax a = 0 this gives the following efficient form: 
$1432 = +(@4 — @2) X (£3 — z1), (11.30) 


which is identical to the formula for a plane quadrilateral. This equivalence 
holds only for doubly-ruled surfaces with straight edges. The general formula 
for a cell face vector is 


Smnpq = (En — Bq) X (@p — em), (11.31) 


assuming the cell vertices are numbered such that the diagonals are mp and 
nq and that the vector has a positive component in a direction of increasing 


£2 (cf. Fig. 11.6). 


It may be shown that 


Smnpq//P0 ; (11.32) 


with no the normal in the cell face center ao. It is easily seen from (11.17) 
that 


xo = (Em + En + 8p + £q). 


Geometric identity 


Let I; be the surface of the cell 2;. Since T} is closed we must have the 
geometric identity (11.15). As mentioned before, for uniform flow to be an 
exact solution of discretization schemes it is necessary that the geometric 
identity is satisfied exactly. Since our cell faces are smooth, n is defined 
unambiguously, and it is easily shown that 


where summation takes place over the cell faces constituting T}. We see that 
the geometric identity is satisfied. 


Economic formula for cell volume 


We can now write down a more economical formula for the cell volume. The 
cell face vectors are needed in the discretization. It would be nice if they 
could be reused to compute cell volumes. It would not be unexpected if the 
volume is related to face area times face distance. The vector 2b, (defined in 
(11.21)) connects face centers. We rewrite (11.28) as 
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|92;| =${b1 - (bz x ba + biz x by3) + bz - (b3 x By + Bo x bı2) 


11.33 
+ bg - (by x bz + by3 x b2a3)}. l ) 


After some manipulation we find 


1 
bz x ba = 35 (@23 X %67 + £67 X Esg + B14 X 23 + Esg X £14) 


and 
biz x big = + (£17 X Bog + £35 X Œ Bog X X Bag X 235) 
12 13 = z (t17 28 35 17 + €38 46 + B46 35) ; 
so that 
bı - (bz x b3 + biz x by3) = aby - ($1265 + $4378) . 


Not unexpectedly, this is the average area of two opposing faces times their 
distance divided by 8. The other two terms in (11.33) can be handled in the 
same way, resulting in the following efficient formula: 


| 2; = x {by ` (81265 + $4378) + b2 - ($1584 + $2673) + ba - (81432 + S8765) } .- 
(11.34) 


Since Smnpq is needed anyway, as noted before, this is the most efficient 
formula for the cell volume. Equation (11.34) consists of parts that can be 
reused for adjacent volumes. For example, b4 = + (23478 — £1256), and #3478 
can be used both in 9; and 92;42-,. The same holds for s4378, and the other 
vectors in (11.34) can also be used more than once. A more complicated exact 
formula for the cell volume has been given by Davies and Salmond (1985). 


Exercise 11.4.1. Show that the volume of the parallelepiped spanned by 
three vectors a,b,c is a- (b x c). 
Hint: |b x c| is the area of the parallelogram spanned by b, c. 


Exercise 11.4.2. Prove equation (11.32). 
Hint: on a cell face , n// 3B x Ja. 


11.5 Introduction to tensor analysis 


The reader who is only interested in discretization schemes on colocated grids 
can skip this section. 


When working in a general coordinate system it is desirable to formulate 
the equations in an invariant form, i.e. a form that is independent of the 
coordinate system. For this purpose vector and tensor notation have been 
developed. Vector notation is a special case of tensor notation, and is less 
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handy for expressing higher order tensors. We therefore prefer tensor nota- 
tion. 


For an introduction to tensor analysis, see Aris (1962), Sedov (1971) or Sokol- 
nikoff (1964). Here we will give an introduction that covers the essentials 
required for formulation and discretization of physical laws in general coor- 
dinates. The reader is assumed to be familiar with vector analysis. 


11.5.1 Invariance 


At the basis of tensor analysis is the fact that a physical entity remains the 
same, no matter what coordinate system is used for its description. In other 
words, it is invariant under coordinate transformations. This means there 
must be a relation between descriptions in two different coordinate systems. 
This relation is studied in tensor analysis. Furthermore, physical laws do not 
depend on coordinate systems. They can therefore be expressed in a form 
that does not depend on the coordinate system used, and is invariant under 
coordinate transformations. Such invariant formulations can be derived with 
tensor analysis. A law that cannot be put in invariant (tensor) form cannot 
be physical. 


So invariance refers to independence of coordinate systems of physical objects, 
and, at the same time, of the formulation of their description and of the 
physical laws which govern them. To give an example, we define a vector u 
in Cartesian coordinates by 


u = uel!) + uzel?) + uzel?) ; 
with e!@) the unit vector in the z®-direction. This expression obviously holds 
only in Cartesian coordinates. With tensor analysis we will find the following 
description: 


u = Ual!) -+ Uzal?) + Uzal?) . 


This formula holds in any coordinate system, with suitable definitions of U« 
and al®), 


Coordinate transformations 


We will restrict ourselves to Euclidean space. This means that the space 
can be described by a Cartesian coordinate system æ = (z1, z?°, 2°). We 
will always use æ to denote a Cartesian coordinate system and Greek let- 
ters, such as €, to denote general coordinates. All coordinate systems will 
be right-handed and coordinate transformations will be one-to-one, so that 
the Jacobian of a coordinate transformation 7 = (€) satisfies, as already 


discussed in Sect. 11.4, 


486 11. Discretization in general domains 





J>0, (11.35) 
with 
=| Re =|% sa (0) 
~ [OE €2,&) | [OE] OE! \8E aE 
Sn On One (11.36) 


Cans 
Y 
D y 
iai 
V N 
œ% 
Can 
now 


w 
w 
(eni kes] 
w 


Caa 


I] 
oS Ga 
mk oF 
mS IS 
on ry 

RR, 
wry 


Coordinate transformations satisfying (11.35) are called admissible. 


Summation convention and notation 


In order to simplify notation, tensor analysis always uses the summation con- 
vention. This was used before but is repeated here: 


Summation takes place over Greek indices that occur once as a 
subscript and once as a superscript in a term or product. 


Since nothing changes when an index over which summation takes place is 
replaced by another letter, it is called a dummy index. Other indices are called 
free indices. 


A vector will be denoted by lower case Latin letters, e.g. u, its components in 
a Cartesian coordinate system are denoted by ua or u% (ug = u%, of course), 
and its components in a general coordinate system are denoted by U® or 
Ua. We will see how U® and Ux may be deduced from u shortly, and find 
Ua FU". 


When a vector (or, more generally, a tensor) is defined everywhere in the 
domain as a function of location we have a vector field. For brevity we will 
just call this a vector, and similarly for tensors and scalars. 


When we write an expression or equation with free indices, for example 
6° Ug = V%, we mean 6°°U, = V%, a = 1,2,3. Free indices in terms in an 
equation must correspond. For example, U® = Va is not allowed, neither is 
U* = V®, Any free indices must be present in every term in an equation. 
Furtherrnore, the only way in which an index may be repeated in a term or 
product is as a single superscript-subscript pair. Hence, VU? is not allowed. 
These syntactic rules help to guard against mistakes. If they are violated in 
an equation, it is usually not a tensor equation. 
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Contravariant and covariant vectors 


Let n = n(€) (i.e n% = n° (£1, £7, €9)) be an admissible transformation from 
a general coordinate system € to a general coordinate system n. Let U%(&) 
be a set of three functions of £1, £2, £3, which transforms to another set of 
three functions U8 (n) in n-coordinates. If we have the following relations 
fe = E ya 
BEX” 
then U is said to be a contravariant vector (or contravariant tensor of order 
1). The contravariant nature is expressed by the index being a superscript. 
The order is the number of indices. The quantities U® , Ong /0€* and U® are 
functions of position in the underlying space. It is understood that they are 
evaluated in the same position in equation (11.37) and all other equations. 


(11.37) 


If Ua(Ẹ) is again a set of three functions, but it transforms according to 
Og 

= —— Ua, 11.38 

B dnp ( ) 


then Ua is said to be a covariant vector. The covariant nature is expressed 
by the index being a subscript. 


The special thing about these transformation rules is that they are linear, 
homogeneous and transitive. We show this for (11.37). Linearity follows from 





an? An? an? 
—!_ (pe a) Wn Ure at ye, 
gen U EVO = Beal” + Bea 
Homogeneity follows from, with c a constant, 
On? | tray — OM? pra 
pea CU ) = EZM . 


These two properties show that a linear combination of tensors is a tensor, 
making it possible to formulate tensor equations. For transitivity, consider 
a third coordinate system Ç. Suppose rule (11.37) is satisfied also when we 
go from n- to ¢-coordinates. Transitivity means that the transformation rule 
(11.37) is also satisfied when we go directly from € to ¢. This follows easily, 


ous Q 
since we have, denoting by Ų the components in ¢-coordinates: 
* AC 5 — OC% An? y _ 96° 


la — _ % n -S r 
U =m = En TE 


Invariance of tensor laws 


We assume that all transformations are admissible, so that the corresponding 
Jacobians are positive. Hence the determinant 
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B 
det (Fah =—JF0, 
so that in (11.37) we have 
U*=0 4> Ù’ =0. 
Let two contravariant tensors satisfy 
cU%+dV*% =0, (11.39) 


with c and d constants. It follows from linearity and homogeneity that the 
left-hand-side is a tensor. From J Æ 0 it follows that in another coordinate 
system 


cU* + dV" =0, 


and from transitivity that (11.39) holds in every admissible coordinate sys- 
tem. Thus we have obtained the invariance concept that we want: 


Lemma 11.5.1. (Ricci) 
If a tensor law is valid in one coordinate system, then it is valid in every 
coordinate system. 


This means that it is not necessary to say what the coordinate system is when 
we write down an equation like (11.39). Ricci’s lemma is a powerful tool, 
because it allows us to verify the correctness of a tensor law by inspecting it 
in a special coordinate system, such as a Cartesian system. 


General tensors 


The transformation laws (11.37) and (11.38) are not the only laws that are 


. . 81 OO . 
linear, homogeneous and transitive. More generally we have: Tay, s 8 
Bg 











mixed tensor of rank p+ q and weight w if its components Thee in 9- 
coordinates and Tai as in -coordinates are related by: 
Frap — ðE w On™ On 0g ðE’: Te (11 40) 
PaPa On| BEN | BEY InP AnBa 19a? ` 





where w is a real number, and Ey is the Jacobian defined in (11.36). This 


is called a mixed tensor, because it has both covariant (subscripts) and con- 
travariant (superscripts) indices. It has order p + q, and contravariant order 
p and covariant order q. If w Æ 0, the tensor is called a relative tensor with 
weight w; if w = 0 the tensor is called absolute. 


The product of two tensors is a tensor, for example UUP is a contravariant 
tensor of order 2. A vector is a tensor of order 1. 
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Scalars 


A tensor of order zero (no indices at all) is called a scalar, From (11.40) 
follows the transformation law for a relative scalar of weight w: 


5 = {2 
= (an 


w 


(11.41) 





The simplest example of a scalar is a function of position in space. Neither 
the space nor the position of a point P change as we change coordinates from 
Ẹ to 7, so that 


(np) = plẸp) ; 


or 


which is what is meant when we write 
P= P, 


so that according to the transformation law (11.41) this is an (absolute) 
scalar. It follows immediately that the value of a relative (w # 0) scalar in 
a point in space changes as we change coordinates. One might think that 
such a quantity has no physical meaning, because its value depends on the 
coordinate system chosen. However, this is not so. We will see later that the 
mass flux in a flow field is relative vector. The reason is that the length scales 
associated with coordinate increments change under coordinate transforma- 
tion. 


The Kronecker tensor 


From now on we write 65 for the Kronecker delta. It is defined by 
6;=1 if a=, 6; =0 if aff (11.42) 


in every coordinate system. Let (11.42) hold in €-coordinates. We try to see 
if the transformation law (11.40) holds. We have 


ja On" OE? oy _ ON™ OE” _ ON ga 
6 OEY PnP OT BEY ANF Bnp P? 


since 7', n*, 7° vary independently, so that ðn% /ðnf = 0 if a + 8. Comparison 
with (11.42) shows that 7 is indeed an absolute mixed tensor of order 2. 
According to Exercise 11.5.3, 6° and dag are not tensors. 
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The infinitesimal displacement vector 
Let dæ be an infinitesimal displacement vector. Its Cartesian components dz“ 
and its components in a general €-coordinate system d£* are related by 
ðr” 
dr = — dé? . 11.43 
sak (11.48) 


We see that the components of an infinitesimal displacement vector dê® trans- 
form like a contravariant vector (or tensor). 


The gradient 


Let v(x) be a differentiable function. The Cartesian components of its gra- 
dient are Op/O0xz%. We have 


de _ HP dp 
Oz® Ax IEP 





so that #4 is a covariant vector. This shows why it is natural to use subscripts 
for covariant components. 


Exercise 11.5.1. Show that the Jacobian of the product of two transforma- 
tions equals the product of the Jacobians of these transformations. 


Exercise 11.5.2. Show that the following transformation law 


æ w On 8 
Ved 968 


is linear, homogeneous and transitive. (Hint: use Exercise 11.5.1). 
Exercise 11.5.3. Show that dag and 6°? are not tensors. 


Exercise 11.5.4. Express U% in terms of u“ for cylindrical polar coordi- 
nates. 


Exercise 11.5.5. Show that 0°—/0€*0E* is not a tensor. 


11.5.2 The geometric quantities 


The geometric quantities are fundamenta? quantities related to the transfor- 
mation from Cartesian to general coordinates. 
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The base vectors 


The base vectors have already been introduced in the preceding section; we 
repeat their definition here. The contravariant base vectors are defined by 


al) = grad ¿° . (11.44) 


In other words, the Cartesian components of al®} are given by 


a) = OF 
B = 9x8 


Note that a‘®) is a vector and not a vector component. The parentheses serve 


to distinguish between al) and aig) Ha) will be introduced presently. For 


. . ct . . 
the same reason al®) is not a contravariant vector; a‘ ) are its Cartesian 


components. It is called the contravariant base vector because it is written 
with a superscript, which is natural because of its definition (11.44). It follows 
from (11.44) that al®) is perpendicular to coordinate surfaces €“ = constant. 


The covariant base vectors are defined by 


Ox 
Ba) = Jee ; (11.45) 


which means that its Cartesian components are given by 


It follows from (11.45) that aa) is tangential to coordinate lines along which 
¿° varies and £f, 6 # a is constant. Fig. 11.7 gives a geometric interpretation 
of the two kinds of base vectors in two dimensions. Note that, unlike the 
customary base vectors in a Cartesian coordinate system, the length of ai?) 
and a@(,) need not be unity. 


It follows that the contravariant transformation law (11.37) in the case of a 
transformation from Cartesian to general coordinates can be written as 


UÊ = aur = al) au, (11.46) 
uf = at U° Or u= a(q)U% , (11.47) 


which tells us how the Cartesian and contravariant components of a vector 
u are related. In fact, we could have taken (11.46) as the definition of the 
contravariant components of the vector field u in general coordinate system, 
from which the transformation law (11.37) would have followed. Similarly, for 
a transformation from Cartesian to general coordinates the covariant trans- 
formation law (11.38) can be written as 
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a 
a® 
42) 
ai i 
a 
a) 
Fig. 11.7. Covariant and contravariant base vectors. 

Ug = afp Ua = ag) u, (11.48) 
ug = aUa or u=alVU,. (11.49) 


We see that U? and Ug are two different representations of the same vector 
u; all that is involved is a change in base vectors in which u is expressed, 
according to (11.47) and (11.49). 


As suggested by Fig. 11.7, the contravariant and covariant base vectors are 
mutually orthogonal. We have 





al) . avg, = 83 , (11.50) 
since 

Et Oa" _ OE 

Ox? IEL GEP ` 


Equation (11.50) allows us to determine al®) if Qa) is given, and vice-versa. 
For example, let a.) be given. Then we have the following three equations 
for the three components of a): 


a.a = 6h, B=1,2,3. (11.51) 


The matrix of this system is (a@(1) @(2) ays)", where (@(1) @2) @(3)) is 
understood to be the matrix with a(.) as columns, and its determinant is the 
Jacobian for a mapping æ = #(€) given by (11.22): 


vi = 


This quantity was formerly called J and was assumed to be positive; why we 
now prefer to call it \/g will becomes clear later. Using Cramer’s rule, the 
solution of (11.51) is easily found to be 





Ox 
= = a(1) + (@(2) X @3)) = |a) a2) ays] - (11.52) 


1 
(1) — 
a = —(a xa . 
Z. (2) (3)) 
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[he general relation is 
1 
(a) — — i 
a'™ = ap X a , a, 8,y cyclic. 11.53 
Ti (8) (y)) Y ( ) 


Here cyclic means that a, 8, y should be an even permutation of 1, 2, 3, Le. 
1, 2, 3 or 2, 3, 1 or 3, 1, 2. 


The permutation symbol 
The permutation symbol € 4g, is defined as follows: 


0, if any two of a, 3,7 are equal 
Expy = $ 1, if a, 8,7 is an even permutation of 1, 2, 3 (11.54) 
—1, if a,8,y is an odd permutation of 1, 2, 3 


For example, €122 = 0, €231 = l, €213 = —1. 


The vector product c = a x b of two vectors a and b can be expressed as, in 
Cartesian coordinates, 


Ca = Eapya” b? . 


Hence 


a(a) * (41g) X A(y)) = E50n4a) 05) tta) - 


Let us define €g,. to be a tensor with components defined by (11.54) in 
Cartesian coordinates. If this quantity is indeed to be a tensor we must have 
according to (11.40) in -coordinates 


ðr? Ax? deh w 
Eapy = Pak JE OEP BET OTH T (V9)" (a) *(@(a) x Ay). (11.55) 


According to (11.52), the right-hand-side equals tyg” if a, ĝ, y are an 
even or odd permutation of 1, 2, 3, respectively, whereas it equals zero if any 
two vectors in (11.55) are the same, so that if w = —1 we have indeed, as 
desired, 


Eapy = Eapy » 


which shows that £€agą, is a relative tensor of weight —1. Similarly, ¢%°7 is a 
relative tensor of weight 1. Absolute variants are /géag 7 and ae *Py, 


va 
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The metric tensor 


The length ds of an incremental displacement vector dæ satisfies, using 


(11.43): 


2 3. dx® ax% B B 
— . — L Iae Y — 
ds? = dæ - dæ = 2, JEF DE dEP dE = gp, dEP dE” , (11.56) 
with 
3 
Ox) Ox 
gap = Dy Bee OER T Ma)” Me) ° (11.57) 
y= 


We cannot use the summation convention in one member of (11.56) and 
(11.57) because we sum over two superscripts. It is easy to see that gag is an 
absolute covariant tensor of order 2: 


“O27 Ox? _ > Be? BEX OE BE? OEH AE? 
y=1 


jap = > SE = SEE OE OE 
y=] ane on 





BEX One BE? An ~ An InP Sue » 


so that the transformation law (11.40) holds when we go from €- to 
n-coordinates. The tensor gag is called the metric tensor, because it relates 
distance to infinitesimal coordinate increments, according to (11.56). Note 
that gag is symmetric: 


Jap = gpa. 
Contravariant and mixed versions are defined by 
g° = al) * alô), 95 = al®) . aip) . 
Because of (11.50), 
08 = 65 


Cartesian coordinates correspond to the mapping æ = €, so that in Cartesian 
coordinates 


Jaa = g°* =1 (no summation), 
Jap = 9°" =0, aß. 


The determinant with elements gag is called g. We have, from (11.57), 


(11.58) 


g= |(@(1) ap) aga) (aa) ag) a)l = lag) ag a@|?, (11.59) 


where |(@(1) @(2) @(3)| is understood to be the matrix with aq) as columns. 
Equations (11.59) and (11.22) explain why the Jacobian of a mapping 
æ = «(€) (from general to Cartesian coordinates) is called \/f. 
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Contraction 


Contraction is summing over a pair of indices of a tensor or tensor product. 
It can be shown that this gives a new tensor with the same weight but the 
order reduced by two. For example, 


Tg? =V, 9° Tp, = S, 


with V® and S$ tensors. Note that g°"T;, is a tensor of order 4. It is left to 
one of the exercises to show that 


9°" gyp = 59. (11.60) 


Raising and lowering indices 


From (11.46) and (11.49) it follows that 
Ue — aff). alU, — g?°"Us ; 
Similarly, 
Us = gapU" . 


This shows how the metric tensor is used to raise or lower and index. More 
generally, 


Bry. Apr Ny a PY¥1 1. Ky 
g Ti. Be = T . 


Poea 
It follows that dummy indices can be lowered and raised at will, e.g. 
TP Ng = TS NP. (11.61) 


The inner product 


Using (11.47) and (11.49) we can write 

u-v = afa) : aP UCV. 
Using (11.50) this becomes 

u.v = UVa = Ua V°. 


Physical components 


If n is some unit vector (length one, direction arbitrary) then the projection 
of a vector u on n (i.e. un) is called the physical component of u in the 
direction n, and denoted by up(n). We have 
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up(n) = U°N, = Ua N®. 


In a Cartesian system ug is the physical component in the z°-direction, but 
in general Ua is not the physical component in the £%-direction. The unit 
vector parallel to a(,) for some a is 


N = A(a)/|@(a)| = @(a)/VJoa (no summation), 
so that 


N? = g8/ gaa (no summation), (11.62) 


and 


up(n) = Ua/VJaa (no summation). 


Similarly, the covariant representation of the unit vector parallel to a®) for 
some a is 


Ne = 93/Vg%* (no summation) , (11.63) 


and 


up(n) =U" //g** (no summation). 


We will also need physical components of second order tensors, in order to 
evaluate the shear stress at a wall, for example. In Cartesian coordinates, in 
a fluid the stress component in the 2°-direction on a surface element with 
unit normal n is given by 


f° = tF ng ’ 


with t° the (symmetric) stress tensor. The physical stress component in the 
direction of a unit vector m is given by 


f-m= Mat? ng . 


Because this is a tensor equation, equating two scalars, according to Ricci’s 
lemma it holds in every coordinate system: 


f-m=MaT°PONg . 


Suppose we want to determine the physical shear stress at a wall along which 
E? = constant. The unit normal to the surface is given by (11.63) with a = 3. 
This gives 

f- m= MAT? /V/933. 


For the component perpendicular to the wall, Ma is given by (11.63) with 
a = 3, resulting in 


f-m=T?/V/9. 
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Christoffel symbol 


The remaining geometric quantity of importance is 


da 
a, _ (a) (8) 
=a .§——. , 
tsy gy 
which is called the Christoffel symbol (in the €-coordinate system). We have 


a,  d&* x? 
loy = Dar DEFOE 


so that 


a a 

tat = bal 
There are 18 Christoffel symbols in three dimensions and 6 in two. The 
Christoffel symbol is not a tensor (we will not show this), which is why we 
use the notation with curly braces and not something like '%, which would 


suggest a third order tensor. The role played by the Christoffel symbol will 
become clear later. 


Exercise 11.5.6. Prove (11.61). 
Exercise 11.5.7. Prove (11.60). 
Exercise 11.5.8. Prove (11.62) and (11.63). 


Exercise 11.5.9. Prove that a coordinate system is orthogonal if and only 
if gag = 0 for 8 fa. 


Exercise 11.5.10. Show that the angle 6 between €% and € coordinate 
lines satisfies cos 9 = gag/,/Jaagse (no summation). 


Exercise 11.5.11. Determine ,/g for cylindrical coordinates. 


Exercise 11.5.12. Find the relation between U® and u in cylindrical coor- 
dinates, and determine the physical components. 


Exercise 11.5.13. Cylindrical coordinates £% are defined by xz! = £t, 
z? = E? cos £3, r? = E? sin €3. Show that the only non-zero Christoffel sym- 
bols are: 


a) = ~€? ' Go) = 1/é? : 
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11.5.3 Tensor calculus 


The covariant derivative 


Operations such as taking the divergence or the curl of a vector field, or the 
formula for the stress tensor in a fluid, involve taking derivatives of vector 
fields. In Cartesian coordinates we define the following notation: 


Ua, B = OUg/OXz . (11.64) 


In general coordinates we want to define Ua,g and U% such that these quan- 
tities are tensors and reduce to (11.64) in the case of the identity mapping 
æ = €. It turns out that 0U,/0€° is not a tensor (cf. Exercise 11.5.14). We 
have, assuming that Uag will be defined such that it is a tensor, using (11.40), 


Ax? Ax 

age ah o 

Or" ð Of 

ae DEP (Dae ~) 

Ax? BEY Uy Oz? 8 O | 
BEX Bae BEF t BEX DEF Bea YY = 
OU, O2° ð El 
Gent ab ae Gas) 


Uap = 


(11.65) 





where we have assumed that U, is an absolute tensor. Equation (11.65) is 
not very elegant, because the second term involves a mixed derivative with 
respect to both æ and €. We can eliminate this mixed derivative as follows: 
Ox? O° oe") _ ð (on Oey, oE ð? rI 
OEL IEP OxI” OEP OEL Ax? Oz? OE“ DEP 


_ ða y 
= a. ger = -GA , 











so that we find 


MU 
Uas = gee — guy . (11.66) 





This is called the covariant derivative of the absolute tensor U®. In a similar 
way we find 


US = = + tgp U" , (11.67) 


and for absolute second order tensors: 
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oT? a B 
aß _ 6B ad 
Tye = EE ET + (Bare (11.68) 
OT? ó 
Toy = ðE + {Tp — pT ’ (11.69) 
OT. 6 ô 
Tasa = Gey T h Te gT: (11.70) 


The usual rules that hold for differentiation also hold for the covariant deriva- 
tive, such as 


(UUP), = UPUS + UU, (11.71) 


as is easily seen. In fact, this follows immediately from Ricci’s lemma, since 
(11.71) obviously holds in Cartesian coordinates. 


The covariant derivative of the metric tensor 


In Cartesian coordinates g*? is given by (11.58), so that in these coordinates 
gt? =0. (11.72) 
Because this is a tensor equation, equation (11.72) holds in every coordinate 


system, according to Ricci’s lemma. Proving (11.72) without Ricci’s lemma 
would be a tedious affair. 


Suppose U% = 0. We have, using (11.72), 
Ua,e = (Gar"),6 = ga g » 
hence 


Uap = 0) . 


The covariant derivative of a constant vector 
Let c be a constant vector field. Then 
X u 
Cg = 0 
holds in Cartesian coordinates, and hence in general coordinates we have 


C% =0. 


500 EL. Discretization in general domains 


The derivative of a scalar 


For an absolute scalar yp we have y(é) = y(€(x)), so that 
P,a = Op/0E° 
is a covariant vector. 


A useful identity that we will not demonstrate here is the following: 


0/9 _ b 
aa = VI) (11.73) 





This clearly does not transform like a tensor, so that the covariant derivative 
of relative scalars is not given by (11.73) but by something else. More gen- 
erally, the covariant derivative of relative tensors is defined differently from 
(11.66)—(11.70), but will not be needed here. 


Divergence, curl and second order differential operator 


We have, using (11.73), for the divergence of a vectorfield: 
divu = U% = 0U%/d€* + {gue = ——(/gU*). (11.74) 


Similarly, 
poe — 1 OgTe? 
6 Tg age 


In Cartesian coordinates, the curl of a vectorfield u, called w = curlw, is 
given by w® = €°PY ug q. In general coordinates this becomes 


+ {tT . 


1 
R= ge Ven , (11.75) 


where we have inserted the factor 1/,/g (=1 in Cartesian coordinates) in 
order to make {2% an absolute tensor if U, is absolute. Because this notation 
is common in fluid dynamics we make an exception here, denoting a vector 
by the Greek letter 92. Because (11.75) is a tensor equation and is true in the 
Cartesian case, it holds in general coordinates. Equivalent forms of (11.75) 
are 


1 1 
Qe = ae gag U2, = Fae 9060") 4 ; 


vg 
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In Cartesian coordinates the general second order differential operator Ly 
working on an absolute scalar p is given by 


Ly = (af pa) ,6 » (11.76) 
Define the vector field b by 
bP = aon, 
then 
Lp = div b, 


which shows that Ly is a scalar. Hence (11.76) is a tensor equation, and holds 
in general coordinates. Using (11.74) we can write 


1 ð Op 
Lg = — = op Fy 
In order to obtain the Laplacian V’y in general coordinates we cannot re- 
place a°? by 6%°, because 6° is not a tensor, as shown in Sect. 11.5.1. But 
aP = g®® does the job in Cartesian and hence in general coordinates, so 
that in general coordinates the Laplacian is given by 
1 ð Op 

Vp = ay (Va f =). 11.77 
11.5.4 The equations of motion in general coordinates 
Using the rules of tensor notation, tensor calculus and Ricci’s lemma, the for- 
mulation of the governing equations of fluid dynamics in general coordinates 
is surprisingly easy. It suffices to write down syntactically correct tensor laws, 
that are valid in the special case of a Cartesian system. The equations in a 


Cartesian system have been presented in Chap. 1. They can almost be copied 
verbatim, except that here or there an index has to be raised or lowered. 


General coordinate systems 


The invariant form of the mass conservation equation (1.13) is found to be: 


ay + (PUN) « = 0 
Using (11.74) this can be rewritten as 
Op 1 ð ay 
at J age V9PU )=0 
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The momentum equation (1.24) takes the following general form: 


dpU 


apt (PU UP) g= 18 + BFE , (11.78) 


with F£ the contravariant representation of the body force. For the invariant 
formulation of (1.25) some raising of indices is required, and the Kronecker 
symbol ag (not a tensor) must be replaced by g°%, cf. Sect. 11.5.1. The 
invariant expression for 7°? follows from (1.25): 


1 
r° = —pg?? + 2u(e%? — ar ); 


1 
of — _(gPV[J% ary a A= — qU* 
The invariant form of the energy equation (1.40) is found to be: 


oO 1 


t a ae (VgpU* E) = (Uar®®),9 + (ko Tp),a + PUaFE + p4 - 


(11.79) 


Exercise 11.5.14. Prove that 0U,/0£° is not a tensor. 
Exercise 11.5.15. Prove (11.67). 


Exercise 11.5.16. Show that the 3x3 determinant with elements bag equals 
Ea791062863,, With Eagy the permutation symbol. 


Exercise 11.5.17. Derive the Laplacian in cylindrical and spherical coordi- 
nates, starting from (11.77). 


Exercise 11.5.18. Show that the heat conduction term in the energy equa- 
tion (11.79) can be written as 


(k ag OT 
= (k./9g ahaa ay. 
Exercise 11.5.19. Take u = constant and use (11.74) to show that 


0 
pea (V30) = 0 


12. Numerical solution of the Euler equations 
in general domains 


12.1 Introduction 


The purpose of this chapter is to generalize the methods described in Chap. 10 
for computing inviscid compressible flows in one dimension to more dimen- 
sions and general domains. Although no new principles will emerge, this 
generalization is not completely straightforward, except for the JST scheme 
(which is its great charm), which will therefore not be discussed here. 


We will mostly discuss the three-dimensional case, leaving the two-dimen- 
sional case for the reader to work out. The domain {2 C R? is arbitrary, and 
is assumed to contain a single block boundary-fitted structured grid with 
hexahedrons as cells, as discussed in Chap. 11. In practice the shape of the 
domain is often so complicated, that domain decomposition (cf. Sect. 11.3) 
has to be used. Domain decomposition is also frequently used to create sub- 
tasks for parallel processing. Conditions to couple subdomain solutions and 
iterative methods to iterate to global convergence have to be devised. Intro- 
ductions to the mathematical background of domain decomposition are given 
in Chan and Mathew (1994) and Smith, Bjgrstad, and Gropp (1996). We will 
not discuss domain decomposition. 


For reasons to be explained in Chap. 14, the methods to be disussed in 
the present chapter do not work well in the (almost) incompressible case. 
A unified method for the compressible and the incompressible case will be 
presented in Chap. 14. 


12.2 Analytic aspects 

This section runs parallel to Sect. 10.2; here we discuss only extensions 
brought about by multi-dimensionality. 

Euler equations 


The Euler equations in Cartesian coordinates (1.76)—(1.78) can be written as 
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U, + af*(U)/de*=Q, 2EN, tE (0,7), (12.1) 


where we sum over a € {1,2,3}, and where 


p m” 0 
m? u*m! + pô? phy 
U=| m |, fX= | ume + pio |, Q= pf? , (12.2) 
m? u*m? + pds phy 
DE me m+ ff 


where m® = pu“, and 6? is the Kronecker delta. This is the conservative for- 
mulation. We will also need the nonconservative formulation with dependent 
variables p, u and p. For the momentum and energy equations we take the 
inviscid and nondiffusive versions of (1.27) and (1.43), respectively. We use 
the perfect, gas law e = -H8 to eliminate e from the energy equation, which 
becomes: 

Dp _ pDp . 

E_E (1 u + (y—1)pq. 

Di p Dt (1 — y)pdivu + (y — 1)p¢ 
Using the mass conservation equation, Dp/ Dt is replaced by —pdivu. The 
nonconservate formulation of the Euler equations becomes: 


ie + pdivu = 0, (12.3) 
Du |1 

Di + p8radP = fo, (12.4) 
Dp . 

Tr + ypdivu = (y — 1)øq . (12.5) 


Homogeneity of the flux function 
Suppose the equation of state is given by 
p = pg(e) . 
Note that the perfect gas law is a special case. Since e = e(U) = {Us — + (U2 + 
U2 + UZ)/U1}/U1, we have e(uU) = e(U), with p an arbitrary real number. 


In the same way as in Sect. 10.2 it is easily shown that the flux functions are 
homogeneous of degree one: 


Fe (pU) = wf (U), 
and that we have 


f*U)=F°U)U , F° =0f*(U)/AU . 
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The Jacobians of the flux functions 


The Euler equations (12.1) can be rewritten in nonconservative form as 
U,+ F°0U/dc* =Q. (12.6) 


According to Sect. 2.2 this is a hyperbolic system if there are 5 linearly 
independent real eigenvectors R with real eigenvalues A for the following 
generalized eigenproblem: 


(ng F% —AM)R=0. (12.7) 


To settle this we will study the generalized eigenproblem (12.7). 


It turns out that the eigenvalue problem is more tractable if we transform to 
the following nonconservative variables: 


p u! 
W=[ul, u=| xv]. (12.8) 
p ue 
By defining Q = 0U/OW and writing 


p 
U = | pu 
Æa + Gpuu 


(where we have used the perfect gas law pe = p/(y — 1)) we obtain: 


1 0 0 100 
Q= u pl 0 , F=t01 OF . (12.9) 
zuu pur 001 
The inverse is found to be: 
1 0 0 
Q= | —-u/p I/p 0 


uu ~(y—1)u? y-1 


We expect that the eigenvalue problem (12.7) is simplified by similarity trans- 
formation with Q, as in the one-dimensional case (cf. Sect. 10.2). The trans- 
formed eigenvalue problem becomes: 


(ngF°-ANR=0, F°=Q-IF°Q, R=QuIR. (12.10) 


However, we do not wish to undertake the labor of carrying out the ma- 
trix multiplications Q-1F%Q. It is easier to obtain F° by inspection of the 
nonconservative Euler equations (12.3)-(12.5). These can be written as 
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ƏV saV > 
att ape TA. 
with 
up 0 0 0 
0 ul 0 0 1/p 
Fi-~}00wu 0 0], 
000 zu 0 
0 yp 0 0 wi 
udp 0 0 
0 ur 0 0 0 
F = | 0 0 w? 0 I/p], 
0 0 0 uw? O 
0 0 yp 0 uw? 
uo 00 p 0 
0uw 00 0 
F=j,0 00 0 
0 0 0 wu 1/p 
0 0 0 yp uv 


The eigenvalue problem (12.10) can be conveniently formulated as 
(A-ANR=0, 
with 


mu pny png pnz 0 
0 nu 0 0 ni/p 
0 0 nu O0 neo/p 
0 0 0 nu n3/p 
0 ypny ypn2 ypn3 n'u 


The characteristic polynomial is easily found to be 
(n-u —d)*{(n-u— A)? enn}, è= yp/p, 
with c the speed of sound. This gives us 


Ap=nu-—cln|, =A= Mnu, As=nuteln|. (12.11) 


The eigenvectors 


For À = Az2,A3,A4 we find the following equations for the eigenvector 
R= (fi, se P): 


f5 =0, nız + nař3 + n3ř4=0. 
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Three linearly independent solutions are: 


ni n3 nə 
0 . -n2 . -n3 
Ř=| n |, R=| m |, R=| 0 (12.12) 
-n3 0 Thy 
0 0 0 


For à = À; and A = Az solutions are easily found by choosing a nonzero value 
for the fifth component and solving for the other components by backsubsti- 
tution. In analogy with the one-dimensional case (cf. (10.10)) we choose the 
values —pc/2 and pe/2 for the fifth component of A, and Rs, respectively, 
and we obtain: 


` —p/e ` 1 pfe 
Ri= | n/|n| |, R= 7 n/|n| ] . (12.13) 
—pe pe 


Because we have obtained a full set of eigenvectors for the generalized eigen- 
value problem (12.10), the system is hyperbolic. 


Characteristic surfaces and bicharacteristics 


As seen in Sect. 2.2, if we freeze F? locally then there are plane wave solutions 
of the type 


~ 


U=Ueil=-) Ú = constant . 


A plane defined by w(æ,t) = n- æ — At = constant is called a characteris- 
tic surface. The normal to the characteristic surface in (t, æ) space satisfies 
gradw=n, we = —A. Now we take F” variable, so that n and A are not 
constant, and define characteristic surfaces by 


w(t,æ) = constant, gradw=n, w= -A. 


As seen from (12.11), A = A(n} , and n is arbitrary. In every point there is an 
infinite family of characteristic surfaces. Their (local) envelope is called the 
Monge or Mach conoid, and the curve of tangency between a characteristic 
surface and the Monge conoid is called a bicharacteristic. The bicharacter- 
istics sweep out the Monge conoid. We will determine the bicharacteristics. 
Let (so, s) be a tangent vector to a bicharacteristic in (t, æ) space. Since this 
vector is tangential to the characteristic surface, we have (so, s): (—A,n) = 0, 
or 


sn=X(n)so. 


Since (sg, 8) is also tangential to the Monge conoid, it is tangential to the 
characteristic with n changed infinitesimally: 


308 12. Euler equations in general domains 


s:(n+én)=X(n+dn)so. 


Obviously, in these equations we can choose sg = 1. For A = n-wu these 
equations for the bicharacteristic direction become: 


s:n=u-n, s-dn=u-dn 
with solution s = u, so that 
(so, 8) = (1, u) . 


In other words, the bicharacteristics corresponding to 41,A2 and A3 are the 
particle paths. For A= n'u + c|n| we obtain 


sn=unteln|, s-{n+ôn)= (n+ ôn)u ecen + én], 


with solution s = u+cn/|n|. Hence, tangent vectors to a bicharacteristic 
corresponding to A, and As are, respectively, 


(s0, 8) = (1, u — en/|n|), (s0,8) = (1, u + en/|n]). 


Boundary conditions 


Information travels along bicharacteristics. The number of boundary condi- 
tions to apply in a point P at the boundary is therefore equal to the number 
of different types of bicharacteristics that enter the domain at P for every 
possible choice of n(A). Let the inward normal to the boundary at P be m. 
If m-+u > 0, the bicharacteristics corresponding to Az, 3 and A4 enter; these 
are counted three times, so the particle path bicharacteristics bring along 
three boundary conditions. If P is at a solid wall (m-u = 0) or if there is 
outflow at P (m-u < 0) at most two boundary conditions are to be applied. 
Bicharacteristics corresponding to A; enter for all n if 


m-u—cm-n/|n|>0, Vn, 
which is the case if the normal inflow velocity is supersonic: 
m-u>c. 
Finally, bicharacteristics corresponding to As enter for all rè if 
mutem-n/|n|>0, Vn, 


which is the case if there is inflow or if there is outflow with subsonic normal 
component: 


m'u >c. 
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Having determined the number of boundary conditions to prescribe, the ques- 
tion arises what kind of boundary conditions to prescribe. This is restricted 
by the so-called compatibility relations. These hold in characteristics, and give 
rise to the Riemann invariants in the one-dimensional case. The multidimen- 
sional case is more complicated. Compatibility relations will not be considered 
here; see Hirsch (1990), Chap. 16 for an extensive discussion of compatibil- 
ity relations and boundary conditions. Some guidance in selecting boundary 
conditions may be obtained from the one-dimensional case (Chap. 10), where 
the availability of Riemann invariants makes it easy to determine which com- 
binations of boundary conditions are correct. 


For a solid wall, it follows from the criteria given above that precisely one 
condition is to be given. This should be the normal velocity component. At 
a supersonic inflow boundary all unknowns must be prescribed. At a sub- 
sonic inflow boundary, a good possibility is to prescribe the velocity and one 
thermodynamic quantity, for example temperature. At a subsonic outflow 
boundary, for the single condition to be prescribed one may select another 
thermodynamic quantity, for example pressure. 


Rotation of coordinate system 


In the next section we will find it convenient to rotate coordinates, for easy 
evaluation of the expression f*(U)n,, with n a unit vector. We start by 
studying rotation of coordinates. 


Let æ = (x1, 27, 2°) be a Cartesian coordinate system and let y = (y', y?, y3) 


be another coordinate system, defined by 
y =L ae, z= Ly (12.14) 


with L a nonsingular constant matrix. It is well-known that the transforma- 
tion (12.14) represents a rotation if and only if L is an orthogonal matrix, 
i.e. 


L=. 


This we now show. Obviously, the coordinate systems are share the same 
origin, and because the transformation is linear, the y-system is rectilinear. 
Denote the base vector along the y°-axis by aja). We have 


Ox ðr? 
— P — — 
Ba) 7 Oy , (a) ~ Oy ~ lga ’ 


i.e. [gq is the -component of a;q) in the #-system. We have a rotation if the 
y-system is again Cartesian, i.e. if the base vectors are orthogonal: 
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Ala) * Ap) = dap - 


This gives (the position of indices is irrelevant in this section, and the sum- 
mation convention is invoked for pairs of equal Greek indices) 


bya lye = das 5 
or 
LL? =1, 
which is what we wanted to show. 


We now turn to the Euler equations. Denote the velocity components in the 


-system by v = (v!, v?, v). We have 
y 


v% =U Aa) = wai) = uP’ lea , (12.15) 
hence 
v=Lu, u= Lv. (12.16) 


We wish to consider what happens to the expression f*(U)nq, given in the 
#-coordinate system, under a specific rotation of coordinates. Let the rotated 
y-coordinate system have the y-axis aligned with n: 


(1) =n. 


We do not care about the direction of the y?- and y°-axes, but if one wants 
to make a choice a possibility would be to take, denoting the base vectors of 
the x-coordinate system by e,q), 

@(2) = €(1) X @(1), (3) = @(1) X G2); 
provided e(1) # aq); otherwise no rotation is applied. Using (12.15) and 
(12.16) we obtain 


1 


pv 
f% (U)na = F*U)ahy = | pvu + pan) |, 
pvt H 
so that we can write 
f*(U)na = LF (V), (12.17) 
with 
_ 100 p 
L=;O0L0], V=ļ| p 
001 pE 


Equation (12.17) will be used later. 
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Exercise 12.2.1. Show that the coordinate system defined by (12.14) is 
rectilinear. 


Exercise 12.2.2. Show that if (12.14) is a rotation, then lag is the cosine 
of the angle between the base vectors €(,) and ag) of the original and the 
rotated coordinate system, respectively. 


12.3 Cell-centered finite volume discretization on 
boundary-fitted grids 


Integration over finite volume 


Finite volume discretization is carried out in the usual way. The Euler equa- 
tions (12.1) are integrated over the cell 92; depicted in Fig. 11.6. By using 
the divergence theorem we obtain, using the summation convection for Greek 
indices and the notation of Chap. 11, 


dU; a 
vira +/f Nod = |92;|Q; , 
F; 


with n the outer normal, and U; and Q; the volume averages of U and Q, 
respectively. The surface integral is evaluated by approximating f® by its 
value at the center of the cell face. Taking as an example the integral over 
the ‘east’ cell face, i.e. the cell face that separates 9; and 2;42e, (let us 
denote this part of T; by Tg), we get 


[ irads = Fg f nads = F2sar , 
Te Fre 


with the cell face vector sg defined by (11.30), and Fẹ an approximation of 
fE. For the ‘west’ face we get, taking the sign convection for the cell face 
vectors s into account, 


J f ngdS = — F Saw . 
Iw 


First, we assume a Cartesian grid, and generalize the schemes discussed in 
Chap. 10 to three dimensions. Then general structured grids will be consid- 
ered. 


Cartesian grid 


Consider the cell-face Tg, supposed to be perpendicular to the z}-axis with 
outward normal in the positive direction. We obtain 
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f Fraas = fras = Filsel, (12.18) 
Ie Fg 


with |sg| the area of Tg, and F} the numerical flux, which remains to be spec- 
ified. This can be done in a way that is closely related to the one-dimensional 
case, because f? and f’ do not occur in (12.18). 


With flux splitting schemes, F$ is obtained by splitting the flux f+, analogous 
to (10.83): 


Fr = ft(UL) +f (UR), 


with Uz, and Up states to the left and right of Pg. These states may be the 
states in the centers of the two neighbor cells of Fg, or extrapolated states 
at le in MUSCEL fashion. 


With approximate Riemann solvers, F! is obtained by specifying a Riemann 
problem at lge. Without loss of generality we assume t = 0. Then the Rie- 
mann problem to be used is given by 


au af 
ðt zl (12.19) 
U(0,2)=U,, z!<zh, U(0,2)=UR, z'>zh. 


This problem is similar to the one we had in the one-dimensional case, and ex- 
tension to the multi-dimensional case is relatively straightforward. The above 
Riemann problem is solved exactly (Godunov) or approximately (Osher) or 
is approximated by a linear problem which is solved exactly (Roe). Denote 


a i 1 
the solution obtained by U(=—=£). For the numerical flux we take 


FR = f'(U(0)). 


The Roe scheme 


Similarly to the one-dimensional case, with the Roe scheme the Riemann 
problem (12.19) is approximated by the following linear problem: 


ðU ðU afi(t) 


a t4 U 4= TU 





(12.20) 


The constant matrix A is the Jacobian of f? evaluated at the Roe-averaged 
state U. It suffices, as it turns out, to specify the averages of u and H: 


a VPLYL+yPRUR p VPLHL + Jprir 


VoL + yPR VoL + JiR (12.21) 
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The Riemann problem for (12.20) is solved in the same way as in the one- 
dimensional case. Let R1,..., Rs be the eigenvectors of A, and let 


5 


Up -Ur = ap.. (12.22) 
p=l 
Then we get (cf. (10.66): 
1 ix 
FR = z{f (Ur) + f'(Ur)} — 5 Do lAplap R - 
p=1 


The quantities A,,a, and R, are determined as follows. The eigenproblem 
for the Jacobian of f! follows from the eigenproblem (12.7) by taking n = 
(1, 0,0). From (12.11) we find 


1 


Mae ——, M= àsa, ApH +e. 


The averaged velocity u follows from (12.21). The averaged sound speed č 
is given by (10.70): 


The eigenvectors follow from R, = QR, with Q given by (12.9) and R, by 
(12.12) and (12.18). In order to beautify R, we apply a suitable scaling to 


~ 


R,, and choose, with R the matrix with columns Rj, ..., Rs: 
10 0 1 

00 0 c/p 

01/p 0 0 

0 0 1/p 0 

00 0 e 


Let R be the matrix with columns Ra, ..., Rs. From R = QR we obtain, 
evaluating in the Roe-averaged state: 


1 1 0 0 1 
ül—-ë al 0 0 l+ 
R= u? u 1 0 # . (12.23) 
Ti u? 0 1 u? 
H — cu! lu-u ū? u? H+ eal 


Let {RP} be the orthogonal set of row vectors satisfying R? R = 62. Then 
we get from (12.22) the following expression for ap: 


ap = R?- (Ur — UL). 


514 12. Euler equations in general domains 


To evaluate a, we need R?,...,.R°, which are the rows of R+. We note that 


0-£00 + 

g 1 0 00-4 

R=] 0 0 p0 0 . (12.24) 
00 0p 0 
0 £ 00 & 


yin? -4i i a 
ce ' 47 ze, 2ee 2G? 2g? 2c? 
1— in? yur que eG 
4 2, č? č? č? z2 
R= -āū 0 1 0 0 f|, (12.25) 
—ii3 0 0 1 0 
way? 2 3a? IS 3 
-z +4M o ea e Te Ie 


where 7 = y — 1 and M = u-u/é*. This completes the specification of the 
Roe scheme in the three-dimensional Cartesian case. The application of the 
sonic entropy fix of Harten (1984) follows easily from the one-dimensional 
case (cf. Sect. 10.3). 


The Osher scheme 


Application of the Osher scheme to (12.18) gives the following numerical flux 
(cf. (10.77)): 


1 


Fh = iU) + UR) - 5 [AW ao, (12.26) 


with A(U) = Of'(U)/OU. The procedure is similar to the one-dimensional 
case in Sect. 10.4. For an arbitrary hyperbolic system, the path would be 
divided in five parts I),..., 15, with 


dU dU 
qo 7 Te on Ti, I 


with Rp the eigenvectors of A. In the particular case of the Euler equations, 
where we have Àz? = A3 = Aq = ul, we need to divide the path in only 
three parts, as we will see. We choose I, (0 < o < 1/3), T (1/3 <a < 
2/3) , T3 (2/3 < o < 1) according to 


= R4 on If etc., 


dU dU 
JT = Rs on Ly, — € Span{ Re, Rs, R4} on D ; = 
o 


do do = Ry on Í}. 
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The eigenvectors R, are the columns of R in (12.23), omitting the overbars. 
The rows of R`}? are called RP; R`! is given by (12.25), omitting the overbars. 
We have 


Roe = 0, g=1,2,3,4 on Ti, 

o 

Ro = 0, g=1,5 on Ih, (12.27) 
a 

re = 0, g=2,3,4,5 on T3. 
o 


To draw conclusions from this we transform to the nonconservative variables 
W defined by (12.8), so that (12.27) gives for suitable values of q 


d 
Rigen <0. 
do 


with Q given by (12.9)). Remembering that R+, ..., R5 are the rows of R`}, 
and noting that RIQ = R-! with R`} given by (12.24), we easily obtain 
—£(du' — zdp) 
dp — àdp 
R-'QdW = R71dW = pdu? 
pdu? 


£ (dul + tdp 


In the same way as in the one-dimensional case we see that 


2c 
pp’ = const., u! — T const. on Ti. 


y — 
Furthermore, obviously, 


u? = const., u? = const. on Ij. 


Similarly, 
— 1 2c 2 3 
pp’ = const., u` + = const., u‘ = const., u? = const. on T3. 
y — 


On I, (12.27) gives 


u = const. , P= const. 


From these relations the states at the end points of the subpaths can be 
determined. The relations between u!, p, p and ¢ are the same as in the 
one-dimensional case. Using results from Sect. 10.4 we get 
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1 1 aot 1 
Ugj3 = Uj = “apl 
y-1 
o/3 = Fa dq Yi T Yo): €2/3 = @C1/3 , 


41/3 = Z0, %2/3= Z1, P2/3 = P1/3- 


Here a = c1/co, Yo = uo—2e0/(y—-1), by = ur +2e1/(y—-1), z = In(pp77). 
The pressure follows from (10.81). Furthermore, we have 


q _ 4,4 q yf — 
uija = Uĝ, U23 = Ul, q=1,2. 


This determines the integration path. The integral in (12.26) is evaluated in 
exactly the same manner as in Sect. 10.4. 


The van Leer scheme 


For the van Leer scheme we write 
F$ = f+ (UL) + f7 Ur). 


It easily seen that in the three-dimensional case a splitting f! = ft + f7 
satisfying (10.84) and the six requirements listed in Sect. 10.5 is given by 


(1+ M)? 
£ ft ((y — 1)M + 2) 

ft — u? fr , 
u? fr 
(FEP H2O- DAT} + PAP (uu? + uF u3) 
-&(1- M)? 
s fy ((y-1)M— 2) 

f= u? fy , 
u fr 


(yia PHUP — fp} + SAT (uu? + uu) 


where M = u!/e. 


The AUSM scheme 


The three-dimensional version of the AUSM scheme proposed by Liou and 
Steffen (1993) is given by 
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1 . 1 ee 
Fz = 5 (M+ [M|)eUz + 3M- |IMi)eUr + PÈ + PR., 


Me = M} +M}, 
1 1 
M*=+7(M+1)?, M|<1; M*=3M+ĮM]), [M[>1, 
1 0 
u! pt 
U = pe | u? , P*=| 0 ; 
u? 9 
H 0 


pt=2Q4M), IMI<i,, p#=50+|M|/M), M>1. 


Boundary-fitted grids 


On boundary-fitted grids the flux at a cell face Tg is approximated as follows: 


f foncds Snar | fas, 
Iz Ie 


where np is the outward unit normal at the center of lg. This can be rewrit- 
ten in a convenient form by introduction of a local Cartesian y-coordinate 
system, with the y!-axis aligned with ng. By using (12.17) we obtain 


nas | fas =Le f fl(vjds , 


where we have appended a subscript E to L to emphasize that the coordinate 
rotation is associated with Tg; for each cell face a different coordinate rotation 
is used. The advantage of coordinate rotation is, that f? and f? disappear, so 
that we have a close analogy with the Cartesian case, already discussed. The 
integral over Tg is aproximated in the same way as (12.18) for the Cartesian 
case: 


J P(V)dS = F$jsg], 
TE 


and Fi(Vi, Vr) is evaluated in the same way as in the Cartesian case. The 
final result is: 


J Fraas ~ eF}(Vı, Va)isel . 
rg 


Of course, the flux over the same face for the neighbor cell (Tw in that 
case) does not need to be evaluated separately, but equals minus the flux just 
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obtained, as in any conservative scheme. 


The semi-discretized scheme that results after spatial discretization can be 
summarized as follows: 


dU; 


[2 = X {Ljs|F*(V+, vyts, 
t g (12.28) 
+ — TT + 
Vitea = Lise Vite, y 


where j = (j1, j2, ja), €1 = (4,0,0), e2 = (0, 4,0), e3 = (0,0, 4). The index 
j E €a refers to the face of 9; in the +x%-direction. If MUSCL is not used 
then Ute, = Uj42e,, Uje, = Uj, otherwise these are the slope limited 
extrapolations from the neighboring cells to the cell face at j + €a in the 


obvious way. 


Exercise 12.3.1. Show that the Riemann invariants associated with the hy- 
perbolic system 


ðU 
— +éf'(U)/dz' = 0 
ðt 
are the entropy S, the velocity components u? and u?, and u! + 2c/(y — 1). 
Show that S, u? and u? correspond to a contact discontinuity. 


12.4 Numerical boundary conditions 


As we have seen, the derivation of numerical fluxes in the general case is 
effectively reduced to the Cartesian case by rotation of coordinates. Therefore 
it suffices to discuss numerical boundary conditions for the Cartesian case. 


Number of analytic boundary conditions 


Suppose the cell face Tg is at the boundary of the domain. The numerical 
flux at Ip is given by 


Fh = F!(Uc, Ug) 3 


where Uc is the state in the interior cell of which Ig is a face, and it remains 
to specify the state Ug at the outer side of Tg using the analytic boundary 
conditions and the scheme. Analytic boundary conditions have been discussed 
in Sect. 12.2. An extensive treatment of analytic and numerical boundary 
conditions is given in Chap. 19. of Hirsch (1990). 
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The Riemann invariants relevant to the choice of boundary conditions are the 
Riemann invariants of the hyperbolic system (12.3.1). It is left to the reader 
to show that these Riemann invariants and their corresponding eigenvalues 
A are given by 


ul — 2e/(y- 1), A=ul—e, 
S, =u, 
u? , A=u!, 
ue A=u', 


u! + 2c/(y—1), A=ulte. 


We assume that the outward normal on Ig points in the positive zt-direction. 
Then if \ < 0 the corresponding Riemann invariant is incoming, and if A > 0 
it is not incoming, and will be called outgoing. As seen in Sect. 12.2, the num- 
ber of analytic boundary conditions equals the number of incoming Riemann 
invariants. 


Numerical boundary conditions 


As said before, Ug is the state at the outer side of lg, and there may be some 
vagueness as to whether the desired boundary state is effectively enforced at 
Ir or in the cell center of a virtual cell at the outer side of Tg. In the case 
of a solid boundary it is important to ensure that the boundary condition is 
enforced precisely at the boundary, but in the case of an inflow or an outflow 
boundary the precise location of the boundary is usually of little consequence, 
so the precise location where the boundary condition is effectively enforced 
is also of little consequence. 


In the case of supersonic inflow, i.e. uh < —c, the above difficulty does not 
occur, since schemes based on approximate Rieman solvers or flux splitting 
all give 


Fr = f' (Ue), 


and it suffices to prescribe the full state vector Ug, which in this case is given 
by the analytic boundary conditions. 


In the case of subsonic inflow, ie. -c < uh < 0, there are four incoming 
Riemann invariants and four analytic boundary conditions are given. First, 
assume that the four incoming Riemann invariants are given: 


— 2 _ 4,2 3 _ 3 1 
Se= So, Up=ug, ug= ug, ug —-2cge/(y-lj=n. 


As said before, the state in the cell adjacent to Fg is denoted by Uc. To obtain 
a fifth condition, we may extrapolate the outgoing Riemann invariant: 


520 12. Euler equations in general domains 


up + 2ce/(y—1) = ub + 2ce/(y-1) = rz. (12.29) 


We find 





Using (10.18) we have 
e? = yp"! exp(S/cy) , (12.31) 


which enables us to find p from ¢ and S. The following boundary state is 
obtained: 


1 S 
{ce exp(—So/cy)}I-7 , 


PE = 
Mr = PEU, Me =peug, mb = PEU, 
1 1 
E)g = pe(——— c} + up‘ up). 12.32 
(PE) (ach + 3er ue) (12.32) 


In practice, usually the incoming Riemann invariants are not given, but four 
other quantities are prescribed, for example po and uo. If again we extrapolate 
the outgoing Riemann invariant, we obtain 


y-1 


9 (r2 ~~ ug) } 





PE = Po, ME = poto, CE = 
and (pE)g follows from (12.32). 


At a solid boundary the analytic boundary condition is u! = 0. To enforce 
this precisely at Ig, reflection is used: 


1 1 2 _ 4,2 3 _ 43 — 
Up=—-Uc, Up=ue, Up = UC, Sp = Sc. 


A fifth condition is obtained by extrapolation of the outgoing Riemann in- 
variant, according to (12.29). This results in 


2 





CE = (r2 + ub). 


Equation (12.31) gives 
1 1. 
pe = {7ce exp(—Sc/ey)}™~ , 


and (pE)g follows from (12.32). 


In the case of subsonic outflow, ie. 0 < uh < c, there is one incoming 
Riemann invariant, and one analytic boundary condition is given. Ideally, 
the incoming Riemann invariant is specified: 
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up —2ce/(y-l=n. 
Four additional conditions may be generated by extrapolating the outgoing 
Riemann invariants: 
Se = Sc, ub +2ceg/(y-— 1) = ue + 2cc/(y-1) =r, uh = us, 
u? = už. 


Then ul, and cg follow from (12.30), and the boundary state is obtained as 
1 2 
pe = {7°E exp(—Sc/ev)}*7 , 


1 1 2 2 3 3 
Mp = PEUR, Mg = PEUC, Mp = PEUC s 


and (pE)g follows from (12.32). In practice, often some other quantity is 
given instead of the incoming Riemann invariant, for example the pressure: 
pg = po. If we extrapolate the outgoing Riemann invariants we get in a 
similar way as above 


a 
pe = {poexp(—Sc/c,)}=* , ce = VyPo/PE , 
ub = rg—-2Qep/(y-1), upaus, uvp=ue, 


and (pE)g follows from (12.32). 


If, as sometimes happens, ul, is not known a priori, one does not know which 
of the above cases applies. One may then test on u}, instead of ub. 


Transparent boundary conditions 


If the domain is infinite, it has to be restricted artificially for numerical pur- 
poses by the introduction of an artificial boundary. Suppose that no analytic 
information whatsoever is available on the artificial boundary. Artificial nu- 
merical boundary conditions have to be given at the artificial boundary. In 
order to avoid artificial reflections of waves at the artificial boundary as much 
as possible, transparent boundary conditions may be specified. The simplest 
kind of transparent boundary conditions are: 


PE ZPC, ME=™MC, PE~- PC: 


For more discussion on transparent boundary conditions, see Bayliss and 


Turkel (1980), Giles (1990), Karni (1992), Roe (1989). 


Far-field boundary conditions for two-dimensional flow around air- 
foils 


Let us consider the flow around a two-dimensional airfoil in an unbounded 
domain. The undisturbed flow velocity is denoted by U% and the z!-axis is 
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taken in the undisturbed flow direction. There may be shocks at the airfoil, 
but sufficiently far from the airfoil there are no shocks and we have potential 
flow. Let (ut, u?) be the perturbation of the velocity with respect to the 
undisturbed flow. It is well-known (see for example Thomas and Salas (1986)) 
that if the airfoil carries lift, then far from the airfoil the perturbation velocity 
is asymptotically equal to 


1_ £ Ba 2. K Ba} i 
ôu = In zlel + B2z2z2 ? bu — On gle] 4 Br? ° B=V1— Mz, 
(12.33) 


where «x is the circulation. The relation between the circulation and the lift 
force L working in the x?-direction is 


L = KpcUon , 
as shown by Blasius in 1910 (cf. Sect. 72b of Lamb (1945)). 


It is clear from (12.33) that when 3 < 1 the influence of the airfoil extends 
over a large distance in the x*-direction. When £ is very close to zero the 
asymptotic analysis on which (12.33) is based needs to be refined (cf. Mur- 
man and Cole (1971)), but the preceding statement about the long-distance 
influence of airfoils with lift in transonic flow remains true. As a consequence, 
the distance from the airfoil at which the infinite domain is truncated needs 
to be sufficiently large. To give a rough idea, experience shows that if free- 
stream conditions are applied at the part of the artificial boundary where 
there is no outflow, then the boundary has to be at a distance of a least fifty 
chords from the airfoil, especially in the x*-direction. 


Since already at significantly smaller distances from the airfoil the solution 
shows little variation, choosing the computational domain so large implies 
a waste of computing resources, unless the size of the grid cells is increased 
drastically at increasing distance from the airfoil. But this usually has a detri- 
mental effect on the rate of convergence of iterative solution methods. 


An alternative is to choose the computational domain smaller, and not to ap- 
ply free-stream conditions on the non-outflow part of the artificial boundary, 
but to prescribe 


1_ 1 1 2_ 62 
u = U tóu , u? = fu", 


with ĝu” given by (12.33). Of course, this implies iteration on the lift force 
L. For further information, see Thomas and Salas (1986) and Sect. 19.3 of 
Hirsch (1990). 


12.4. Numerical boundary conditions 923 


Evaluation of the pressure at a solid boundary 


With cell-centered finite volume discretization, the state vector is obtained 
in cell centers, which are not located at the boundary. If for post-processing 
purposes certain quantities are required at the boundary, extrapolation has 
to be applied. This is the case, for instance, when lift or drag on a solid body 
are to be determined. For this the pressure distribution on the solid boundary 
is needed. 


The simplest method is piecewise constant extrapolation, in which the pres- 
sure at a cell edge on a solid boundary is taken equal to the pressure in the 
cell-center. However, this often implies wasting some of the accuracy that has 
been realized in the solution in the cell centers, especially when a higher order 
scheme (for example, of MUSCL type) has been used. The normal derivative 
of the pressure may be significant, making piecewise constant extrapolation 
inaccurate. As a simple example, consider the two-dimensional flow around 
a circular cylinder. Using cylindrical coordinates (r, 0), the non-conservative 
form of the stationary momentum equation in r-direction is given by 


Bed geet — Lo = —2P 
per ap T pr”. D0 


At the cylinder v, = 0, so that 


—pv3 —— 


Hence, where a solid body is strongly curved (r small), such as near the lead- 
ing edge of an airfoil, the normal derivative Op/On may be significant. 


Extrapolation of the pressure to a solid boundary may be done by using 
an estimate of Op/On obtained from the equations of motion. This is de- 
scribed in Rizzi (1978) and Sect. 19.2.4 of Hirsch (1990). An alternative that 
is somewhat simpler, especially in three dimensions, is multilinear extrapo- 
lation. We describe this in two dimensions with the aid of Fig. 12.1. With a 
piecewise bilinear boundary-fitted coordinate mapping, the solid boundary is 
approximated by a piecewise linear curve. The pressure is assumed known in 
A, B,C, D. The points of intersection of AD and BC with the solid boundary 
are called E and F, respectively. The pressure is assumed to be bilinear in 
the patch EFCD. We take advantage of the fact that p varies linearly along 
straight lines. This gives 


IDE]  |AE] _ |CF|  |BF| 








PE = PA 


Let H be the center of AB, and let the point of intersection of GH with CD 
be J. Linear extrapolation gives 
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E G F 


Fig. 12.1. Bilinear extrapolation of the pressure 


— p, CLl _ CH] 


Between E and G, G and F the pressure varies linearly. The three-dimensional 
case can be handled similarly, but is a bit technical, and will not be discussed 
for brevity. 


Free slip at solid boundary 


In an inviscid flow model, such as the Euler equations, at a solid boundary the 
normal velocity component is zero, but the fluid should slip freely along the 
boundary in a tangential direction. A condition of no-slip should arise only 
due to viscous effects, and not because of numerical effects in the numerical 
approximations to inviscid terms. Numerical schemes for the Euler equations 
that are to be used for the approximation of the inviscid part of the Navier- 
Stokes equations should therefore allow free slip. We will show that the van 
Leer scheme does not satisfy this requirement, which is why this scheme was 
developed further by Liou and Steffen (1993) into the AUSM scheme. 


We consider a two-dimensional flow in an unbounded domain. At time t = 0 
we have flow in the z?-direction with a slip line along the x-axis: 


u'(0,@) =0, p(0,2) = po = constant, e(0,a) = eo = constant , 
u’(0, æ) =u}, zi <0; u’ (0,æ) =u}, z!>0, 
(12.34) 


with uz and uh constant. From the equation of state it follows that p(0, æ) = 
Po = constant. It is easy to see that the exact solution is given by 


U(t, æ) =U(0,2x). 
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Let us choose a uniform Cartesian grid with step size h in the zt-direction. 
Since there is no z°-dependence there is no need to specify a step size or 
grid point labeling in the z*-direction. The cell-centers z; are chosen at z; = 
(j — 1/2)h, and cell-centered finite volume schemes take the following form: 


dU; lim 1 

res + n Ej+1/2 — Fij) = 0. 
At time t = 0 we have c; = co = constant, and M; = uł/ej = 0. The van 
Leer flux is found to be at time t = 0: 


0 
1 4co/ 
1 _ 0/ 
F;41/2 = g Poco u? _ u? ’ 
3 (uz)? = ACHEI 
so that at t = 0 we have 
0 
E OA {0 
r OO oh 
5 (Uz) — z(uR) 


Hence, the exact solution is not recovered, and the contact discontinuity gets 
smeared as time increases, in a similar way as in Fig. 10.16, corresponding 
to a large amount (O(h)) of artificial viscosity. A flux splitting scheme that 
gives no smearing of stationary contact discontinuities is the AUSM scheme. 
It is left to the reader to show this. The Roe and Osher schemes also give no 
smearing of stationary contact discontinuities. 


With the JST scheme, the second order artificial viscosity term is switched 
off at a contact discontinuity, because it is triggered by pressure variations 
only; we now see why the scheme has been designed this way. The remaining 
fourth order artificial viscosity is sufficiently small for the JST scheme to be 
applicable to computation of viscous flows. 


Exercise 12.4.1. Show that for the AUSM scheme we have dU;/dt = 0 
when the initial conditions are given by (12.34). 
12.5 Temporal discretization 


The semi-discretized system (12.28) is easily discretized in time by the same 
methods as considered for the one-dimensional case in Sect. 10.6 and 10.8. 
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Numerical stability 


For the derivation of heuristic numerical stability criteria, we are led by the 
same reasoning as in Sect. 10.6 to consider the following linear scalar conser- 
vation law: 


Ov Ov 
gi t Hagan =O. (12.35) 


In the spirit of von Neumann stability analysis, we have to assume a uniform 
grid and constant coefficients. Considering flux splitting schemes to be gener- 
alizations of the first order upwind scheme to the systems case, we discretize 
(12.35) in space with the first order upwind scheme, assuming pq > 0: 


dv; Ha 
re p” ~ vj) =0. (12.36) 


We carry out von Neumann stability analysis according to the principles 
presented in Sect. 5.6. The symbol of scheme (12.36) is given by 


a 1 . 
Ly (9) =- So ca(l— e7"), ca = fat /he « (12.37) 


Stability requires that —rL,_(0) € S for all 9 = (01,02,...,04), with S the 
stability domain of the time stepping scheme to be used, for all values of po 
that occur in the flow domain. We must interpret pa as being the largest 
eigenvalue of the Jacobian 0f%(U)/0U, i.e. 


Ha = |ual +e o 
with c the speed of sound. 


The stability results obtained in Chap. 5 are easily applied to the present 
case. Equation (12.37) can be rewritten as 


rÌn(0) = 5(8) + iy2(0) , 
. „a l 12.38 
6(6) =?) casa , 72(0) = $ casin da , Sq = sin’ 5 x- ( ) 
Comparison shows that this is equivalent with (5.34) if we define dy = ca, 
k = 1. This enables us to apply directly the stability results of Sect. 5.8. 


For explicit Euler discretization in time, Theorem 5.8.1 gives the following 
sufficient stability condition: 


Sica <1. (12.39) 


a 
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It is left to the reader to show that this condition is also necessary (cf. Exer- 
cise 12.5.1). 


In practice we do not have a uniform grid and a scalar equation, but a 
boundary-fitted grid and a system of equations. The question arises what 
to take for |ua| and ha in (12.37). The general form of the scheme is (12.28). 
Inspection of this equation shows that we have two candidates for ha, namely 


Ajte. = (Sjea! / |2; . 


Furthermore, (LF%);4., corresponds to a discretization of a one-dimensional 
Euler equation with 2°-axis perpendicular to the cell face [;4.;. The corre- 
sponding maximum wave speed is |uj+e,|+¢j+e,, where uj+e, is the velocity 
component perpendicular to [j;4-,. We define 


Caj = ((lul +)/P)jtes » 


and we replace (12.39) by 


T cag <1, Vj. (12.40) 


Higher order schemes 


Just as in one dimension, for better spatial accuracy than given by first order 
approximate Riemann solvers or flux splitting schemes, we may approximate 
the numerical fluxes F*(V+,V—)|5*¢% in the scheme (12.28) by the MUSCL 
approach. That is, we may use a slope limited scheme. The formulation of 
such schemes follows immediately from the one-dimensional case, and will 
not be given. 


In order to derive stability conditions, we follow the same heuristic reasoning 
as in the one-dimensional case (Sect. 10.8), and require the scheme to be 
stable for the following scalar conservation law: 


Op OF (p) _ 
a + du Ox =0. 


If we discretize in time by the explicit Euler method, then in Sect. 9.4 the 
TVD condition (9.123) is found, which is sufficient for stability: 


So A*Ujte, S1/M, A% = Tfh", (12.41) 


where M is a parameter that depends on the flux limiter used. For example, 
in Sect. 10.8 we found M œ 1.85 for the van Albada limiter. Furthermore, 
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u is the wave speed. Equation (12.41) has been derived for a uniform grid. 
Heuristic extension to systems is done in a way that is by now familiar by 
replacing u by the maximum wave speed: 


SOA (uf + lites < 1/M . 


As before, this is heuristically extended to boundary-fitted grids as follows: 


T caj <1/M, Vj, (12.42) 


a 


with ca; defined before (12.40). 


As we saw in Sect. 9.4, some higher order Runge-Kutta time stepping schemes 
inherit the TVD property from the explicit Euler scheme, and (12.42) con- 
tinues to hold. But if a time stepping scheme is used for which this is not the 
case, or if one wishes to work with a weaker stability condition, then one can 
forego TVD considerations and rely solely on von Neumann stability analy- 
sis, as in the one-dimensional case of Sect. 10.8. There a heuristic stability 
criterion for MUSCL schemes, that works well in practice, is obtained by 
requiring von Neumann stability for spatial discretization by the first order 
upwind scheme and by the «-scheme that corresponds to the limiter selected, 
for the scalar equation (12.35). As an illustration we extend the example of 
Sect. 10.8 to the multi-dimensional case. The symbol of the first order upwind 
scheme is given by (12.38). The symbol of the x-scheme applied to (12.35) is 
given by (cf. Sect. 5.6): 


rin (8) = 71(8) +i), nO =A- r) So cas, 


iad 


y2(0) = 2 cat — K)Sa + 1}sinĝa, Sa = sin? 59. » Ca = |PalT/Re - 


Comparision shows that the symbol of the first order upwind scheme is given 
by (5.34) if we take dy = cq and «x = 1. The symbol of the «-scheme is given 
by (5.34) if we take dy = 0. This enables us to apply the theorems of Sect. 5.7 
and 5.8. 


As in Sect. 10.8 we select as an example the SHK Runge-Kutta method given 
in Sect. 5.8. Theorem 5.7.2 shows that the symbol of the first order upwind 
scheme is within a circle with center at —a and radius a if $ ca < a. As 


noted in Sect. 10.8 such a circle is inside the stability domain of the SHK 
method for a < 1.3926, so that we obtain the following sufficient stability 
condition for the first order upwind scheme: 


X ca < 1.3926 , (12.43) 
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which is weaker than (12.39). We continue with the x-scheme. Using Theo- 
rem 5.7.4 we see that for the x-scheme —7L, (6) is inside the ellipse of Fig. 5.9 
if 

` Ca < ——— and Soe < —? 
- *= 9(1—«) - “= (2—K)V2 ’ 


where a = 2.7853, 6 = 2.55. With the van Albada limiter we have « = 1/2, 
so that we obtain the following stability condition: 


Soca < 1.2. (12.44) 


Comparing (12.43) and (12.44), we see that both the first order upwind and 
k = 1/2 scheme are stable if (12.44) is satisfied. As before, we extend this 
condition heuristically to the case of systems on boundary-fitted grids by 
replacing it by 


TÝ caj <1.2, Vj, 


ad 


with caj defined before (12.40). 


Final remarks 


We have discussed explicit temporal discretization. Sometimes implicit 
schemes are more efficient than explicit schemes. For example, if the mesh 
size of the grid is locally very small, stability may require an extremely small 
time step for explicit schemes, making implicit schemes more efficient. Fur- 
thermore, if a stationary problem is to be solved, iterative solution of the 
stationary problem may be more efficient than time stepping to steady state. 
For a stationary problem or a time step with an implicit scheme, a compli- 
cated nonlinear algebraic system has to be solved. Doing this efficiently is 
not a straightforward matter. Much research is going on in this area. Due to 
the large number of unknows arising in typical applications and the sparsity 
of the associated matrices, iterative methods are to be preferred over direct 
methods. Advances in efficiency and turn-around time are being made by 
algorithmic improvements and the efficient use of high performance paral- 
lel computers. To limit the scope of the present work, we will not go into 
this subject. In Chap. 7 a brief survey of iterative methods has been given. 
The application of multigrid methods is computational fluid dynamics is re- 
viewed in Chap. 9 of Wesseling (1992) and in Wesseling and Oosterlee (2000). 
An introduction to high performance parallel computing in fluid dynamics 
is given in Wesseling (1996a). Iterative methods related to domain decom- 
position are reviewed in Chan and Mathew (1994) and Smith, Bjgrstad, and 
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Gropp (1996). In these publications the reader will find ample references to 
the literature. 


Exercise 12.5.1. Show that (12.39) is necessary. Hint: take 6, = 1. 


13. Numerical solution of the Navier-Stokes 
equations in general domains 


13.1 Introduction 


Most of the introductory remarks made in Sect. 12.1 apply to the present 
chapter as well, and will not be repeated. First, the compressible case will 
be discussed. The inviscid terms can be discretized as in Chap. 12, so that 
only the spatial discretization of the viscous terms needs to be given. Then 
the incompressible Navier-Stokes equations will be considered. The methods 
discussed in Chap. 6 will be generalized from Cartesian grids to structured 
boundary-fitted grids. A unified method for both the compressible and the 
incompressible case will be presented in the following chapter. 


13.2 Analytic aspects 
Governing equations 


The compressible Navier-Stokes equations have been derived in Chap. 1. They 
consist of the equation of mass conservation (1.13), the momentum equations 
(1.24) and the energy equation (1.40). These equations can be gathered to- 
gether in the following system, which is nothing but the Euler equations 
(1.76)—(1.78) with viscous and heat diffusion terms added: 


U, + Of) (U)/ðza + 3g (U) ðr =Q, EN, tE(0,T), (13.1) 


where we sum over a € {1, 2,3}, where U, f% and Q are given by (12.2), and 
where the viscous flux gt®) is given by 


uoh + kT a 


Here Ta stands for 0T/0xr%, and o” is the stress tensor, defined as 


(cf.(1.25)): 
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1 1 
oP = pfe? — 25° uh), e% = Uh + ula). 


We see that g(%) /Ax% contains second derivatives of velocity and tempera- 
ture, and it turns out that the spatial operator in (13.1) is elliptic, making 
(13.1) parabolic. 


Boundary conditions 


Usually the Reynolds number is large, and the viscous terms come into play 
only in small regions (e.g. boundary layers) with high gradients. As a con- 
sequence, the compressible Navier-Stokes equations behave very much like 
the (hyperbolic) Euler equations, and boundary conditions should be chosen 
such that we retain a well-posed problem when p { 0 and k | 0. We therefore 
apply as far as possible the same boundary conditions as for the Euler equa- 
tions, plus some additional conditions to make the parabolic case well-posed. 
This implies that precisely four conditions are to be given at every point of 
the boundary. 


At an inflow boundary we have to prescribe u and two thermodynamic quan- 
tities, for example p and p; T follows from the equation of state. Based on 
our experience with the convection-diffusion equation in Sect. 4.2 we apply 
homogeneous Neumann conditions on the stress and the temperature at an 
outflow boundary: 


Ten? = pon", n°T, =0, (13.2) 


where n is the unit normal on the outflow boundary, pæ is the outflow pres- 
sure, and 


reb — gef por ; 
At a solid boundary the no-slip condition is applied: 
u = UD, 


where u, is the local speed with which the boundary moves; on a fixed object, 
us = 0. Furthermore, at a solid boundary a temperature condition is to be 
given. If the solid boundary is a perfect heat conductor, then the temperature 
of the fluid equals the temperature of the solid boundary: 


T=. 


If the solid boundary is a perfect heat insulator (adiabatic wall), then we get 
a homogeneous Neumann condition at the boundary: 


OT/dn=0. 
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If neither of the two above conditions apply, then the heat flow problem in 
the solid body has to be solved simultaneously with the flow problem, and 
the two problems are coupled by the condition of continuity of heat flux: 


(HT /ðn); = (uOT/On). , 


where the subscripts f and b refer to the fluid side and the body side of the 
solid boundary, respectively. 


At first sight, the outflow condition (13.2) seems to present a problem in 
the case of supersonic outflow, since when p < 1 we effectively prescribe 
the pressure, which is forbidden in the Euler case. However, if the inviscid 
part of the equations is discretized with an approximate Riemann solver or 
a flux splitting scheme this presents no problem, because then the scheme is 
of fully upwind type near a supersonic outflow boundary, which implies that 
the scheme automatically disregards the outflow boundary condition. 


For further remarks on boundary conditions, see Sect. 6.5. 


13.3 Colocated scheme for the compressible 
Navier-Stokes equations 


Integration over finite volume 


Finite volume discretization is carried out in the same way as in Sect. 12.3. 
Integration over the cell 2; shown in Fig. 11.6 gives 


dU; oe. a 
Io; + | O° +9%)modS = [Q51Qs. 
T; 


The surface integral of f can be approximated in the same way as in 
Chap. 12, and does not need to be further considered here. We continue 
with the surface integral of g*. Taking as an example the integral over the 
‘east’ cell face, we write 


[ otnaas = GESaE ; 
re 


with sg the cell face vector defined by (11.30), and G% an approximation of 
JE: 
The path integral method 


Gù has to be expressed in terms of surrounding cell center values of U. For 
this we need approximations of first order spatial derivatives of u% and T 
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in terms of surrounding cell center values. A flexible and accurate way to 
do this is provided by the path integral method (van Beek, van Nooyen, and 
Wesseling (1995), Wesseling, Segal, Kassels, and Bijl (1998), Wesseling, Segal, 
and Kassels (1999)). Denoting the gradient of T by VT we have 


Tite; 
b = THH = / VI -de=VTj4e,-ea), eq = æt. (13.3) 


T; 


Remembering that Tp is between 92; and 2;42e,, we see that VJTj4e, is the 
quantity that we wish to approximate. We see that (13.3) gives a relation 
between VT and surrounding cell center values T; and T;42¢,. In order to 
determine VT in three dimensions, we need two more such relations. These 
may be obtained by choosing two additional non-parallel integration paths. 
For accuracy, these are chosen symmetric with respect to 2;4.¢,. We choose: 


Bi+2e5 Bip+2e, +2e9 


= j+2e2 G+2ert+2e2 _ 
b = Thie, + T 5421-26, = f + / VT -dz 
Dy 2eg © jte —2eg 
~ , — „|j t2ez J+2e1+2e2 
= VT; pe, . C(2) 4 C(2) = The, + Tẹle 2, . 
Similarly, 
Titles Tite] —2ey 
— TPitzes J+2ei+2e3 _ 
bs = T; zes + T 54 2ei—2es = J + / VT dæ 
Py Jeg @i42e) —2ey 


j+2e3 12 2+ 2ert2es 


VTj+e1°C(3)1 (3) = Elh zes + lipaze zes - 


Il 


We now have three relations expressing VT in terms of surrounding cell center 
values of T. This system of three equations is solved for VT by defining 


cle) = Cig) X c(4)/C , C= C1)" (c(2) x c(3)) ' 


with a, 8 and y cyclic. Then the solution for VT is given by 


oT 
aro = cb, . 


The same procedure is followed for the derivatives of uf, and in fact the 
coefficients in the resulting 3 x 3 system are the same. If we redefine 


— ,,f8\o+2e1 — ,f j+2e Bij t2eit2e — 
by =u ý 4 b- =u l 2e + u lj4201-26. } Y — 2, 3, 


then the solution is given by 


due 
are = cb, . 
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For the viscosity and heat conduction coefficients one may take 


~ 


(kj + kj42e1) - 


1 1 
Miter = (Aj t+ Ui+2e)» Kites = 5 


This in essence completes the definition of the discretization of the compress- 
ible Navier-Stokes equations in the interior of the domain. Numerical imple- 
mentation of boundary conditions presents no particular difficulties. Suppose 
for example that Ig is part of a boundary where £t = constant. If OT /ðn is 
given at this boundary, then f sds is given, and nothing needs to be done. 


re 
If T is given at this boundary, then Tj+e, is given, and the integration paths 
are modified by replacing 2e, by eı in the above definitions of integration 
paths. Similar procedures can be followed for the momentum equations. 


Time stepping 


Stability of explicit schemes can only be analized in a approximate maner, in 
a similar way as in Sect. 12.5 for the inviscid case. A representative viscous 
term is added to the model equation (12.35), and stability of the resulting 
scheme is analysed in the same way as in Sect. 12.5. The model equation now 
is a convection-diffusion equation, and methods and results of Chap. 5 can 
be used. 


The remarks made at the end of Chap. 12 about iterative solution methods 
for implicit schemes apply here also. 


13.4 Colocated scheme for the incompressible 
Navier-Stokes equations 


Governing equations 


The governing equations can be written as follows in Cartesian tensor nota- 
tion (cf. (6.1) and (6.14)): 





ui =O, (13.4) 
ĝu” a 
at PF tPa=D, (13.5) 


Foe = uču? — gÊ , oth — v(u’ + uP ) , 


+O 
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Integration over finite volumes 


All unknowns reside in the centers of the cells, as before. Integration of the 
continuity equation (13.4) over the cell 2; depicted in Fig. 11.6 gives 


[ a0 = fenas =0, 


Nj Tj 


with n the unit outer normal. The surface integral is evaluated by approxi- 
mating u® by its value at the center of the cell face. As an example, we take 
the integral over the ‘east’ cell face, i.e. the cell face that separates 9; and 
(2}42e, (in the notation of Chap. 11). This part of T}; is denoted by Ig. We 
get 


f naas = uk fraas = uUBSaE , (13.6) 


TE re 


with the cell face vector sg defined by (11.31). For the ‘west’ face we get, 
taking the sign convection for the cell face vectors s into account, 


U“nadS = —up sow . 


Iw 


The cell face values of u® have to be approximated in terms of surrounding 
cel center values u. We will come back to this later. 


For the momentum equations (13.5) we obtain 


a 
dus 


|2;| J + | Fe? ngds+ | pnadS=0. 
I; L; 





The pressure integral over [g is approximated as follows: 


J pnaas ~ PES , 
Pe 


and pr is evaluated as follows: 


1 
pE = 5 (Pi + Pj+2e,) - 


For the viscous terms we have to approximate integrals, as for example 


J vegngar . 
Iz 
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This is done as follows: 


f vSnsar s ve (us) nse . 
Te 


Derivatives of u% are approximated in terms of surrounding cell center values 
by the path integral method explained in Sect. 13.3. 


The inertia terms are approximated as follows: 


[ wew'ngas = upub sh. (13.7) 


re 


Here ub must be expressed in terms of surrounding cell center values by the 
same method that is to be used for u% in the continuity equation (13.6). 
For u% in (13.7) we can use the usual approximations for convection. For 
example, the second order central scheme is obtained with 
uh = 5 (45 + Uge) 
The first order upwind scheme is obtained with 
up = uj if ufs% >0, 
~ a 


ub Uj42e6, if ub, sf < 0 . 


The «-scheme is obtained with (cf. (9.96)): 


if ul st > 0, and in the opposite case 


1 l-k a 
zi + UF 2e) + = u + 2U5 426, ~~ UF's 4e,) . 


up = 
In the case of a Cartesian grid, this scheme gives rise to spurious checkerboard 
modes, as shown in Sect. 6.3, where various remedies are discussed. The most 
popular of these is the pressure-weighted interpolation method of Rhie and 
Chow (1983), which is discussed in great detail for the case of arbitrary grids 
in Sect. 8.8 of Ferziger and Peri¢ (1996). This implies approximation of uf, in 
terms of surrounding cell center values including pressure values, analogous to 
(6.25). We will not further discuss this method here. Time stepping methods 
have been discussed in Sect. 6.7. At the end of Sect. 6.3, references have been 
given to a large number of publications in which applications of colocated 
schemes to computation of incompressible flows are discussed. 
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13.5 Staggered scheme for the incompressible 
Navier-Stokes equations 


We think that nobody will dispute, that on Cartesian grids, computation of 
incompressible flows is best performed with the staggered scheme proposed 
by Harlow and Welch (1965) and discussed in Chap. 6. In combination with 
the pressure-correction method (Sect. 6.6) an efficient and accurate method 
to compute nonstationary flows is obtained. The physical boundary condi- 
tions suffice. The main advantage over colocated schemes is, that no artificial 
measures are needed to rule out spurious oscillations. 


However, there is no such consensus when the grid is arbitrary. In fact, the use 
of colocated schemes is more prevalent. The reason is that generalization from 
the Cartesian to the general case is relatively straightforward, as seen in the 
preceding section, Staggered schemes are more complicated, and may suffer 
from accuracy problems on nonuniform grids. If the smoothness properties of 
the boundary-fitted coordinate mapping are not carefully taken into account, 
or if too much smoothness is assumed, the accuracy can become bad, even 
on mildly nonuniform grids. This experience has led to a widespread opinion 
that on curvilinear grids staggered schemes are inherently les accurate than 
colocated schemes. But this is not so. 


On colocated grids it is customary to discretize in physical space, as we did 
in the preceding section, and no reference is made to the coordinate map- 
ping, so that its smoothness properties do not come into play, and no serious 
degradation of accuracy is observed as the grid becomes less smooth. Stag- 
gered discretization may also be carried out in physical space; this is done in 
Rosenfeld, Kwak, and Vinokur (1991). But staggered discretization in phys- 
ical space puts a heavy demand on geometric insight and pictorial represen- 
tation, which is why we will develop an algebraic formulation (cf. Wesseling, 
Segal, and Kassels (1999)). Furthermore, we think it desirable to bring out 
explicitly the role of the smoothness properties of the coordinate mapping. 


Geometric quantities and their smoothness properties 


A described in Sect. 11.3, the grid is generated by a piecewise trilinear coor- 
dinate mapping 


æ; = «(€), EN, ECG. 


Fig. 13.1 shows a cell in the physical space 92 and the computational space G. 
This is the same as Fig. 11.6, repeated here for convenience. For mnemonic 
convenience, the cell centers and cell face centers ara indicated by subscripts 
C, N, S, E, W, F, R, with C for center, F for front, R for rear, N for north 
etc. Hence, with the indexing system of Sect. 11.3, 
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l 
Fig. 13.1. Three-dimensional cel. Its image in G is rectangular hexahedron. 


Tc =j, EP = Pj-ez, CE = AXj+e, ; 


for example. With the vertex numbering of Fig. 13.1 we have for example 


8 
1 1 
zo=32 Em , tp = 7 (#1 + eat es + Ts). 


We know from Sect. 11.4 that the points ay,... belong to the doubly-ruled 
cell faces. 


The contravariant and covariant base vectors al®) and @(.) have been defined 
in Sect. 11.5. We repeat for convenience: 


Ow 
Oo” 


It follows that the Cartesian components of these vectors are given by 


aM = VE, aay = 


a Ê 
al) oE B Ox 


B T orb? “(a)~ Ee 
In this section the language of tensor analysis (Sect. 11.5)will be used. There- 
fore the location of an index (subscript or superscript) matters, and summa- 
tion is implied over pairs of Greek indices, one of which must be a superscript 
and one a subscript. 


Because æ = 2(€) is piecewise trilinear, a(.) is piecewise continuous and does 
not exist everywhere. At a cell face £% =constant, aq) is discontinuous and 
will not be used, but ajgy, 8 # a is continuous. For economy we need to ex- 
press Gq) is terms of the cell vertex locations z1, ..., £g in an efficient way. The 
derivatives Ox /O&* are given by (11.25). In C we have (s!, s*, s3) = (0, 0, 0); 
in N we have (s!,s*,s°) = (0,0,1) etc. After some algebra we find from 
(11.25), (11.26), (11.19) and (11.21) 


aac = (te —ew)/Aé', age = (wR - wp)/AE?, 

aje = (æn -@s)/AẸ* , 
aw = zag (t26 — @15), O(3)w = gags (ase — %12), 
QajyrFo = zagt (%48 — T15) , a3)F = sags (t58 — 814) , 
aas = gag (®34— 812), ajs = zapr (®23— T14) 
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etc., where we have used the notation a, = @m+a2,,. For the contravariant 
base vectors we have 

with d; the Kronecker delta. Solving gives (cf. (11.53)): 


1 
(a) li 
a'™ = — (ai) X ayy), a, 8,7 cyclic, 


where ,/g is given by 
VG = Aa) * (ap) X ajy), &, 8,7 cyclic. 


From the smoothness properties of a.) it follows that /g and al) are dis- 
continuous at cell faces. But sgal turns out to be continuous at cell faces 
of the type €% = constant. After some algebra we find that in the cell face 
centers where ,/ga® is continuous, galo) equals the corresponding cell face 
vector times a scaling factor: 


(VIa we = apapswes (vaa ar = aptagsSrr , 


(,/ga')) v, 5 (13.8) 


AEA SN,S * 


Staggered representation of the velocity field 


We want to generalize the classical staggered Marker-and-Cell (MAC) scheme 
proposed in Harlow and Welch (1965) from Cartesian to general coordinates. 
This means that we wish to compute the pressure in cell centers, and nor- 
mal velocity components in cell face centers. It follows from (13.8) that the 
contravariant base vector a(®) is perpendicular to faces of the type ° = 


constant, so at first sight the contravariant velocity components U%, defined 
as (cf. (11.46)) 


U%=u-al 


would seem to be suitable for representing the velocity field u. But we just 
saw that a‘) is discontinuous at cell faces. As a consequence, the use of U* 
leads to bad accuracy on rough grids. But, as we saw, gale) is continuous at 
cell faces where £% = constant. Therefore the following coordinate-invariant 
staggered representation of the velocity field u will be employed (from now 
on denoting æg by @;4.¢,, etc.): 


pea = (/gal) *U)j4e, (no summation). 


From (13.8) it follows that we may also write 
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Viren = (5 U)jtea /AEP AET, 0,8, cyclic. 


This shows that V* Ag? AE? approximates the volume flux through the cell 
face. Therefore V*% will be called the volume flux components. Equation 
(11.31) gives us an efficient formula for ,/ga‘), for example (cf. Fig. 13.1): 


(Ja) 546, = $(w7 — w4) x (wg — x3)/AẸ AE . 


We will need to aproximate V® and u not only in the cell face centers, but in 
other points as well. The general relation between V® and u is (using (11.46), 
(11.47)): 


Ve = yga u, w= ajo V/a. (13.9) 


Finding u requires evaluation of V% in points other than its proper grid 
nodes. Because of lack of smoothness of the geometric quantities, evaluation 
of V“ in other points than its proper grid nodes, and evaluation of u needs to 
be done carefully. A certain tedium is unavoidable. We impose the following 
accuracy requirement on formulas for defining V® in other points than its 
proper grid nodes Œj+e,: constant velocity fields u must be invariant under 
transformation to V®“-representation and back. More precisely, if u is given, 
and V2... 1s computed with (13.9), and V° is determined in another point 
by some interpolation recipe, and u is computed in this point with (13.9), 
then the original u is recovered exactly in the special case that u = constant. 
Furthermore, in the case of the identity mapping æ = €, we will have mul- 
tilinear interpolation. We have found these requirements to be essential for 
maintaining accuracy on rough grids. 


In the cell centers æ; we define (no summation): 
ve = 
(Vga); = 


(V e, + Vien) > (13.10) 
(gad -ea t + (vaal) tea}, (13.11) 


(a) = 4 YI vv Nn p, y cyclic. (13.12) 
(1) . (2) (3)) f°? 
V9 yga = x J/ga')) ; 


We show that the above invariance requirement is met. Suppose u = constant. 


Then (13.10), (13.11) and (13.12) give 


Ve = (/ga'), u, 
Recomputing uj from V? gives, using (13.9) and (13.12): 


| 


uj = (Fate) j(/ga (a) (13.13) 


(cf. Exercise 13.5.1), which we wanted to show. 


542 13. Navier-Stokes equations in general domains 


In the cell face centers 4;4-, we define for 8 # a: 


= (Vf es + Ve. at VP 2eates + Vi 20,—e,) ’ (13.14) 
(Via) jpe, =F {VIa jpeg + (V9a") jeg 

+ (Vga )jtzeates + (V9a"™)i420,-e0} 

(no summation), and (@(a)/y/9)j+ea is related to (/ga)) 46, by replacing 


j by j + €a in (13.12). In a similar way as before it is easily seen that the 
invariance requirement is satisfied. 


(13.15) 


In the cell edge centers @j4¢,+4¢,, 8 # @ we define (no summation): 


teates = IVen + F peat2eg) ] 
(/9a') jreatep = Ft( (vga) )j+ea + (gal) )jpeatzea} , (13.16) 
Viteatep = (Vite, + Vir zeates) > l 
(JGa) ectes = H(I) jpe + (Vga )jpzeates} » 


and for y # a, 8 


V heates =a Vice, + Vihe, t Vj e 42ea + Vite toca + Vie 420, 
+ Vie, t2eg t Vj-e,42eat2es T Vise paare) ; 

(vaa) )jpeates = =ł vga”) )j- ey + (vga jte, + (yga) )j-e,+2ea 

+ (fga) 46,420, + (ga) j-c,426, 

+ (fa) jpe 426, 

+ (Vga); 42e,42¢9 + (9a) j4e,420n420 0} 5 


and (a(4)/V9)j+eates iS related to (/ga) peate, by replacing j by 
j+eateg in (13.12). Again, it is easily seen that the invariance requirement 
is satisfied. 


Furthermore, u is required at the cell vertices #;4¢,4¢,+e,- Lhe way to pro- 
ceed is clear from the preceding cases. We define (a Æ 8, a Æ y, B # y, no 
summation): 


Viterteates 54 (Viren + jteat2e, + Viteat2e, + Vij ea tleptie, E 
(/gal* );tertes = =; (,/ga'*)) Jitea + + (vgal® ) jteat2eg 
+ (gal) \iteat2e, 


+ (J/ga') sre, 42¢94+26, } ; 


and (@(.)/./g) is related to (./gal) IN ®j4e,te,¢e, by (13.12) with appro- 
priate grid point indices. 
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Coordinate invariant discretization of the incompressible Navier- 
Stokes equations 


Finite volume integration of the continuity equation div u = 0 over 92; gives 


a= (13.17) 


3 
YT AEF AETV TE" a, 8,7 cyclic. 
a=l 


3 . 
0 = f divudN = f undr = Y (a-s) J+ea 
2; I; 


J— Ea)? 


For robustness, the discretization scheme should be coordinate invariant. This 
is the case for (13.17), because it contains a contravariant representation of 
the velocity field. 


A straightforward way to proceed would be to discretize the momentum equa- 
tions written in coordinate invariant form, using tensor analysis. However, 
in this formulation the so-called Christoffel symbols occur, encountered in 
Sect. 11.5. These involve second derivatives of the mapping æ = æ(Ẹ). Be- 
cause the mapping is piecewise bilinear, the Christoffel symbols are ‘infinite’ 
at cell edges. Approximation of the Christoffel symbols by straightforward fi- 
nite differences gives reasonable results on smooth grids only. This is perhaps 
what has led to a widespread belief that staggered discretization is inaccurate 
in general coordinates. In order to avoid this difficulty with the Christoffel 
symbols we first transform only the independent variables, obtaining a form 
that is not coordinate invariant, that is discretized and used as a stepping 
stone to arrive at a coordinate invariant discretization. 


The derivative of some quantity y transforms according to 


99 _ (8) E 
apa = Ae JEF ' (13.18) 


Using the identity (cf. Exercise 11.5.19) 


ð 
— (a)) — 


this can be rewritten as 








Op L Sah 
ðr Vg 3EL (Jgag p) . (13.19) 
The momentum equations can be written as 
ðu ÖÖ, dvel) 
Or + Dre (u%u) = -Vp + ane’ (13.20) 


where 
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ðu! ðu? 


das + Ox} 
ðu? ðu? 


Jra t pe? 
ĝu? ĝu? 


ðr” + ðr? 








e(2) 











By applying (13.19), equation (13.20) can be rewritten as 


(uyga ()) = 0. 
(13.21) 


Ou 


N(u,p) = = J + aE — (u V°) + Vp- 


FE 


This equation is integrated over the shifted finite volume 92;}e, depicted in 
Fig. 13.2. Treating each term successively, we obtain 


Tee ee 





Fig. 18.2. Shifted finite volume 197;+.-, 


| Faa = e ldujge,/dt, (13.22) 


Niper 


Ê uve)d dR = f jaa (uVe )ag' dede". 


— 


g 0° 
Nite Gite, 
We have 
5 Eja +1/2 ja +1/2 
Ii = f ger (YV de dg de® = / (uV!) iit ae? de? , 
Gite, Coa 1/2 Eja -1/2 


where the fact that uV! is continuous in Gj4e, has been used. This integral 
is approximated by the midpoint rule: 
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Ty = AE AE (uv) 27 (13.23) 
This is approximated further by using 


l~ 1 1 
V; = (Vie, + Vie) ° 


which contains V! in its proper nodes. If a second order central scheme is to 
be used, then u is expressed in terms of V“ in the manner described before. 
If a first order upwind scheme is to be used, then we write 


Uj =Uj-e, if V;>0, Uj = Uje, If Vi <0. 


Similarly, 


Ias f gaV EE = Ae AP (UV) LE 


Gite, 


This is further approximated using 


2 m~ I 2 2 
Vitertes = Vite, + Vi42e,t09) ’ 


and u is handled in a similar way as before, using a shift in the £?-direction 
in the upwind case. Analogously, 


I= / sea (UV )detdetdgs = AE AE? (uy?) ite tes (13.24) 


jt+ei1—es ? 


Gi+ey 


3 = l(y3 3 
Vitertes — z (Vite, + Vit2e tes) ’ 


and w is handled as above. Integration of the pressure term is done as follows: 


y= J VpadN = Vpjpei| Ripe], 
Nite: 


where we have used smoothness of Vp. The term Vp;+e, is expressed in terms 
of surrounding nodal values using the path integral method (Sect. 13.3). We 
write, using smoothness of Vp, 


© y4+2e) 


bi = pr = J Vp: dæ =Vpjte. tca) ca = elit". (13.25) 
T 
Similarly, 
Bip2eg  Pj+2e]+2e2 
b = pita + pitate = + Vp: dæ = Vpj+e, ` C(2) , 


Tj—2e2 ®Œj+2e]—2ez 


546 13. Navier-Stokes equations in general domains 
— +2 j +2e1+2 
C(2) = zhi ze + al oe, 20, . (13.26) 
Using the third direction we obtain similarly 


— _]ĵj+2e jt2eit2es w . 
b3 = Pl} 2e + Pl 420,26 = VPj+e, ' C3), 


— p42 +2 2 
ca) = ahta tehte te (13.27) 
We now have three equations expressing Vp in terms of surrounding nodal 
values, without any assumption about the smoothness of the coordinate map- 
ping. The system is solved for Vp by defining 


cf) = Cig) X ci) C, C= C1) ° (c(2) x c(3)) . (13.28) 
Then the solution of the above three equations for Vp is 


op 


ar. = cl) b, . 


Integration of the viscous term over §2;4¢, gives three contributions. The first 
is 


Is = f gg (Vag vel) )ag*de?ags 
Gite, 
Eja+172 Éja+1/2 


(Vaal, ve jit dede , 


Ea -1/2 Eja —1/2 
where aag is constant. We write 


Ii; S AE? AE? (yga vel) ) j+ . 
The second contribution is 
Eji 41 Eja+1/2 
fig= f f (viag ve) t zdele , 
Ei $js-1/2 


where sgal?) is piecewise constant. We make the following approximation: 


Iis S AB AE (yga? ve) itet 


+e eR ? 


where we define 


(Vga )jteites = Haa) jte, + (J/ga)) 5s 26:40} , 
The third contribution is i7, which is handled just like T46: 
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1 2 
j+ Cio ti/2 


I7= f f (vaag vel®)) et] de tde? 


¿l 2 
J1 jg—1f2 


AEAEE? (yga velite tes 


j+ei—es t 


He 


where 


(/ga)) i 4e:te, = 3 (/gal)) s+, + (Jga"™) :4 20,405} ' 


In Iys, Iis and Iy7, e) has to be approximated, requiring discretizations 
of derivatives of u. We start with Tı5 and write, using (13.18): 


ne = {a a . 
dx? j, P gge j 


In a cell-center we are not hampered by non-smoothness of the mapping 
x = #(€), and the following straightforward approximation can be used: 


du\ wml (ta) | lites 
(aa), * Dae (2; J ejts i 


The same procedure cannot be followed for Jis and F17, because we are at cell 
edges, where the geometric quantities are discontinuous. Instead, we proceed 
in a similar way as for Vp. An approximation for ĝu? /ðxf in jpe, +ez, for 
example, is derived by writing (no summation): 





Titer teztey 


—_ ajiterteate, _ a ~ a 
ba =" jterter—e, ~ Vu” -de= Vuj4eitez Ciy): 
(13.29) 
Piper tez ey 
— jo teitez+ey L 
C(x) Ea|jpeiter-e, y= 1,2,3 . 


This system of three equations for Vu® is solved in similar manner as the 
system for Vp. We define e'*) by (13.28), taking C(q) from (13.29), and find 


ĝu” 

(F7 itertea S cb, . 
In cf) values of u in cell face centers, cell edge centers and cell vertices 
occur; these are expanded in terms of V* by means of (13.9), (13.12) and 


(13.14)—(13.16). 


The momentum equation (13.21) is integrated also over shifted finite volumes 
2j4e, and 2;4.,, and discretized in a completely similar manner. We may 
consider the resulting finite volume discretization of (13.21), briefly denoted 
as 
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J N(u,p)dQ = 0 (13.30) 


Rite 


with the integral approximated in the way indicated above, as semi-discretized 
evolution equations for u at the cell face centers (by semi-discretization we 
mean discretization in space but not in time). This is not what we want, 
because we wish to generalize the staggered scheme of Harlow and Welch 
(1965) from Cartesian to general grids, which implies that we want evolu- 
tion equations for the volume flux components at the cell face centers. Using 
(13.9), this is achieved by replacing (13.30) by 


(vga) igen . / N(u,p)d2=0 (no summation), 
Ritea 


with the integral approximated in the way described above. This gives for 
the various terms in N (u, p) the following results. From (13.22) and (13.9) it 
follows that 


(vga) 16, + "ugg = |Qj4¢,|dV,.,/dt (no summation) , 


Qjtea 


which is precisely what we want. Furthermore, using (13.23), 


1 } ey +e 
(J/ga™) ;4e, In = FAPA (Yaa) j40, (uje VRE — uj V REG) 
(13.31) 


{no summation). Here u is expressed in terms of V®“ in the way described 
after equation (13.23). The remaining terms are handled in a completely sim- 
ilar manner. A similar method to avoid Christoffel symbols is followed in He 
and Salcudean (1994), Karki and Patankar (1988), Melaaen (1992), Wessel- 
ing, Segal, Kassels, and Bijl (1998), Wesseling, Segal, and Kassels (1999). 


In this way a coordinate invariant discretization is obtained, that in the case 
of the identity mapping æ = € reduces to the classical Cartesian staggered dis- 
cretization of Harlow and Welch (1965). Furthermore, the discretization error 
is zero for u and Vp constant, regardless of roughness and non-orthogonality 
of the grid. It would be too tedious to show this here, but verification by 
numerical experiment is easy. 


One may wonder how it is possible that a coordinate invariant discretization 
has been obtained without encountering Christoffel symbols. This may, to a 
certain extent, be elucidated as follows. The Christoffel symbols are defined 


by 


a, — (a), PA) 
{gp SO ee (13.32) 
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In (13.31), for example, if u is expressed in terms of V%, products of a!) in 
one point and a(s) in neighbouring points are hidden, similar to what one 
would get by discretizing (13.32) (assuming differentiability of a;g)). Another 
way of looking at our circumvention of Christoffel symbols is to note that 
finite volume integration precedes transformation to invariant form, so that 
integrals of the Christoffel symbols are required, removing the derivative in 


(13.32). 


The structure of the stencil 


Fig. 13.3 shows at which points u, V“ and p are used in the discretization of 
the inertia and pressure terms in the equation for V'; values of V® needed to 
express u are included. For clarity, we show only the two-dimensional case. 
Fig. 13.4 shows the points where e'*), u and V® occur in the viscous terms. 
The u points are those needed to express e‘*) in terms of u with the path 





Fig. 13.4. Stencil of viscous term: O : ee: u, |: V!, — : V?. 


integral method, and the V® points are those needed to express u. Note that 
the viscous term contains 9 V! points and 12 V? points; this does not seem 
exorbitant in view of the fact, that mixed derivatives have to be approximated 
in the case of non-orthogonal grids. 
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Boundary conditions 


Numerical implementation of boundary conditions has been discussed for 
Cartesian grids in Sect. 6.4. Generalization to general structured grids is 
relatively straightforward. A guiding principle is that the numerical boundary 
conditions of Sect. 6.4 should be recovered in the special case of a Cartesian 


grid. 


No-slip condition 


Let at the ‘west’ face of the domain £t = gl h the following boundary condi- 
tion be given: 

u(t)j-e, = Volt, &j~e,) 5 j= (1, j2, js) » 1 >= (3,0, 0) . 
Then 


Vine, = —(/ga"”) ° Vb) j—ey 


is substituted in the continuity equation (13.17). If the normal velocity is pre- 
scribed all along the boundary, the three-dimensional version of the discrete 
compatibility condition (6.53) must be satisfied: 


D eav + OD AB APY IE + SO AAP VAY = 0, 
j2 js ja ji ji j 


where E stands for the ‘east’ face of the domain, etc., in the same way as in 
the beginning of this section for cells. 


Because V} .,, j = (1, j2, ja) is given, we do not need to integrate over 2;_.,. 
Where in the integral over 92;4-, velocity values at the boundary occur, these 
are simply replaced by the given boundary values. 


Next, let the no-slip boundary be at the ‘south’ face, i.e. at £3 = Eir Let 
j = (jı, j2, 1). Again, consider the integral over 2;4¢,. 


In i3 we encounter a boundary term 
AETAE*(UV*) i401 -es 
which is known from the boundary condition. 


In Iı4 we cannot evaluate Vp;4-, with the path integral method in the same 
way as in the interior, because this would involve p;_2.,, referring to a grid 
point outside the domain. Therefore the integration path in the £°-direction 
is chosen in a one-sided way. We write 
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— j+2e j+2eit2es wv , 
b3 = pl ° + plte VPj+e1 > €(3) ; 
= jt+2es J+2ei1+2e3 
c3) = æl + Elize, . 


The two other integration paths for the approximation of Vp;,., are chosen 
in the same way as in the interior. 


In i7 we encounter a boundary term 
AGAR ( fgas vel) j42,-c5 - 


Evaluation of ef) by the path integral method in the same way as in the 
interior would lead to the use of grid points outside the domain. We therefore 
again choose a one-sided integration path in the £°-direction. In the case of a 
Cartesian grid also a one-sided approximation is necessary. We write for the 


approximation of VUSte:—e3! 
bg = ufte = Vu? Cc 
3 = j+e1—-e3 — jtei-es ` ©(3) 3 
= j+e1 
c(3) = li te;—es . 


The other two integration paths are chosen as in the interior. 


This covers the principles of the modifications required near a no-slip bound- 
ary. 


Free surface conditions 


We recall from Sect. 6.2 that the free surface conditions are given in Cartesian 
tensor notation by: 


un” = w(t, æ), naosg =0, nao®tg=0, (13.33) 
with n the unit outer normal, and s and t two linearly independent tangent 
vectors in the surface. It is assumed that the grid near the free surface has 
the following local orthogonality property: grid lines that go from the free 
surface into the interior are orthogonal to the free surface. The situation is 
sketched in Fig. 13.5 for the two-dimensional case. If this local orthogonality 
property is not satisfied, numerical implementation of the free surface condi- 
tions becomes complicated. 


Local grid adjustment to satisfy the above local orthogonality property is 
easily done as follows.The (in general non-orthogonal) cell faces on the free 
surface are left unchanged. Take a vertex P on the free surface. The local 
normal direction is taken to be the direction of the average of the four sur- 
rounding cell face vectors. In this direction a line is drawn into the interior. 
Its first intersection with an interior cell face is selected as a new vertex P, 
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Fig. 13.5. Locally orthogonal grid 


which replaces the interior vertex that was originally connected to P. This 
procedure is clarified in Fig. 13.6 for the two-dimensional case. 


Let the free boundary be at the ‘west’ face, i.e. at €1 = Eiz Then 


Fig. 13.6. Local grid adjustment. 


Vle, =~ (Vga v)j-e; j= (1, ji j2) 


is given, so that we do not need to integrate over 2;_.,. 


The pressure integrals are treated in the same way as for the no-slip condition, 
and need not be discussed again. 


From Fig. 13.4 we see that the integral over 92;4}e, requires u;_-,. However, 
if j = (1, j1, j2), @j-e, is at the free surface, and uje, is unknown; only 
the normal component is given. In the special case of a Cartesian grid, the 
normal component of tj—e, is all that is needed. Due to the condition of local 
orthogonality of the grid that we have imposed above, the Cartesian situation 
is approximated to a certain extent, so that the tangential components of 
Uj—e, have little influence. Therefore we venture to express tje, in terms 
of interior values of V% by means of a one-sided evaluation. We write 


Vie = —(l¥ga" |u,)j-e, , 


1 
2 fod 2 2 
Vj e = 5 Vitez + Viez) y 
1 
3 os 3 3 
Vine, = 5 Vites + Vi es) 
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We write, furthermore, 
Ve, = (vga® . u)j-e , (13.34) 


where we define (no summation) 


(VIa) e, = FAVT) jpe + (V9a)j-e}s a523. 
Using the dual vectors &(a) defined by 
üla) + gal”) = 85, 
equation (13.34) is solved by 
Uj—e, = (@(ayV);~e, - 


The dual vectors &{a) follow from 


- gal) x ga 
Aa) = 0). gal) (3) ’ 
„ga fga x yga 


In this way u;_¢, is constructed using both information from the boundary 
conditions and from the interior. 


a, B, y cyclic. 


Next, let the free surface be at the ‘south’ face, i.e. at £° = &1/. Now Vie, 
is given by the free surface condition: 


Vi og = —(VGa|v)j-es j= (ija, 1). 


Consider the integral over 92;4¢,. In the integral T13, given by (13.24), we 
encounter a boundary term (wV*),;4.,-2,. We make the following approxi- 
mation: 


1 
Viter—es = z Vj- e3 + Vi 2, — es) : 


and U;4e,-e, 1s evaluated in similar one-sided fashion as before, using 


1 ty 1 
Vi = Vitex ’ 


R 


2 2 2 
Vite: -e3 = zí jpez t Vj-e a + Vprscrtes + Vte- ea) 


We write 
Vitees = (vga . u)jtei-es 
(Vga )jpeize3 = (9a Jigen , 
(VIa) pezes = FAVT) jpe, + (VIA) ;-0, 
+(/9a")) 5426, 465 + (/ga')) 5 420,-ea} . 


Pamas 
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We obtain 
Uj+ei—es = (G(a)V")j4e1—e5 ’ 
with the dual vectors &{a) related to gal as before. 


It remains to consider the integral over the viscous term. The evaluation of 
elf) in Iis and Fie requires u at the boundary, which is approximated in a 
similar way as above. In I,7 the boundary term 


At! At? (/ga ve) 546, —e3 


occurs. This term is not zero in general, but nevertheless it can be neglected. 
The reason is that what is really needed is not Iy7 but (/ga"); 46, > Liz. 
Taking the inner product of the above boundary term with gal”, we get 


Bı; = AP AR Jga) jype (gal ve”) gees 
= AE AE (yga) 46, (ga PT) peres , 


with o7 the Cartesian shear stress components. Per definition, a3) is in the 
direction of —n (with n the outward normal) and because of the local orthog- 
onality property of the grid that we have postulated, a) is in a direction s 
tangential to the free surface. It follows that we can write 


Biz = AEL AE? yga | 54.6, | /gal?) liter—es(8yM 0") j4e,-e5 = 0, 


according to the free surface boundary condition. So the boundary term can 
be neglected. This concludes our discussion of the numerical implementation 
of the boundary conditions (13.33). 


If we have a truly free surface, it moves with the flow, and the normal velocity 
cemponent is unknown. All three stress components are prescribed: 


P—NoaapNB = Po, NaFagsg=0, NaCagig = 0, (13.35) 
where we have used Cartesian tensor notation. 


Let the free surface be at the ‘west’ face of the domain, i.e. at ¿£t = Eiz 
Since the normal velocity at the free surface is unknown, we have to compute 
Ves j = (1, j2, ja). We therefore integrate over the cell 2;_.,, which has 
half the normal size, cf. Fig. 13.7. The various integrals that we encountered 
in the discretization in the interior can now be approximated as follows. 


First of all we have 


Thy = ACP AE (uv?) 


j=ei' 
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Fig. 13.7. The cel 2j;~e,, 3 = (1, j2, Js) 


When the inner product with (yga )j-e is taken, uj-e, is replaced by 
Vie e1, Which is the unknown for which the integral over je, provides an 
equation, so no further measures are required. 


Next, we write 
~ Aclag3 2 2 
EÇ = AE AE (Uj-ei+es Viten — Uj-e,-e2 Vie) ' 


where u will be replaced by V! when the inner product with (/ga)) se, is 
taken, and no further modifications are necessary. [3 is handled similarly. 


In order to take advantage of the fact that the total stress is prescribed on 
the boundary, we lump the pressure and viscous terms together, and write, 
in Cartesian tensor notation, 


ih 


& 


J (P,a — Tap, pase 


fire, 
Using (13.19) this is rewritten as 
ð 
I, = J gavi p- Vaa oap)dé dE? dg? 
Gj-e, 


This gives several cell face integrals. To begin with, 
B= Aeae3( gap — yga oap) ihe, - 


The inner product with (,/ga"));_., is required, which is in the direction of 
—n, with n the unit outer normal, so that ,/ga\) = —|,/ga") |n. Therefore 
we can write, at # = @j_.¢,: 
1 1 
Jaga) (/ga)p — yaah ong) = 


= | J/ga |? (p — nacagng) = Vga? Pa , 


which takes care of the contribution at æ = #;_.,. For the contribution at 
æ; we can let p; stand, and the derivatives of u in ogg are evaluated with 
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the path integral method as usual, but expressing u;_-, in terms of V“ with 


: ; : 1 2 3 
a one-sided bias, as before, using Vie Vite» Vjte,- Next, we encounter 


1 j —e1 +e? 
IY? = SAC AE (Vga g p — vaa oo) atg ' 


Because of the local orthogonality condition of the grid, a‘?) is perpendic- 
ular to a") at the boundary. Since the inner product with (/ga)); 2. & 
(f/ga")); ete, is required, we may just put p = 0 in J}*. Furthermore, 
again because of the orthogonality condition, (/gal?)); ete, is in a direc- 
tion tangential to the free surface, so that we may write 


2 
(Jga\))) 6, (Yaa oag)j—ester 
= —|fga|;_ e | /ga")|;_ te, (Natap8p)j—e,t0>) = 0 ’ 


where s is some vector tangential to the surface. Similarly, the contribution 
in the third direction 713 = 0. This completes our discussion of the finite 
volume discretization for the cell 2;_., of Fig. 13.7 with boundary conditions 
(13.35). 


Next, we have to consider the integral over §2;4¢, in the case where the 
boundary conditions (13.35) are applied at the ‘south’ face. This can be done 
in identical fashion as described for the boundary conditions (13.33), with 
one small difference: Ve e, is not replaced by a known boundary value, but 
is left to stand, because now it belongs to the set of unknowns. 


Outflow conditions 


The purpose of outflow conditions is to let the fluid flow out of the domain 
while disturbing it as little as possible. For reasons given in Sect. 6.2, pre- 
scribing the free surface conditions (13.35) is a good option. 


Solution methods 


The structure of the system of ordinary differential equations that results 
from the spatial discretization described above is the same as in the case of 
a Cartesian grid as presented in Sect. 6.4, and is given by equation (6.42). 
Temporal discretization can be done in the same way as in Sect. 6.6, using 
the pressure-correction method. For the iterative solution of the pressure- 
correction equation and of algebraic systems resulting from implicit time 
stepping schemes the same methods can be used as for the Cartesian case, 
presented in Chap. 7. 
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Concluding remarks 


We have given a description of a coordinate-invariant spatial discretization 
using a staggered scheme on a boundary-fitted grid. When the grid is Carte- 
sian, the classical scheme of Harlow and Welch (1965) is recovered. As will 
be illustrated by an example in the next section, accuracy is maintained on 
nonsmooth grids. To achieve this we have carefully taken into account the 
smoothness proporties of the boundary-fitted coordinate mapping, avoided 
the use of Christoffel symbols in the way described above, and by implement- 
ing transformations from V% to u and vice-versa in a special way. 


What is the price to pay for the above properties? In a comparitive study, 
in Manhart et al. (1998) it is found that a three-dimensional time-implicit 
implementation of the method described here uses 243 words per cell com- 
puter memory. A time-explicit Cartesian staggered code is found to require 
only 12 words per cell, whereas a colocated code on a curvilinear grid using a 
time stepping scheme of IMEX type (Sect. 5.8) is found to require about 200 
words per node. So a staggered scheme does not require much more memory 
than a colocated scheme. 


Exercise 13.5.1. Prove (13.13). Hint: note that (Fz0(a))3 has been defined 


such that (aq)); - (a®)); = 68. Take the inner product of (13.13) with 


(yga). 


13.6 An application 


In order to illustrate that the general coordinates staggered discretization 
described before is at least as accurate as the discretization methods that are 
mostly used at present in codes to compute Navier-Stokes solutions in com- 
plicated domains, namely colocated finite volume methods using pressure- 
weighted interpolation (Rhie and Chow (1983)) and finite element methods, 
we approximate a simple exact solution on a rough grid. To make our point, 
it suffices to consider a simple two-dimensional example. The exact solution 
is Poiseuille flow: 


ul'=a2?(l—2*), u®=0, p= —2Re'x!, Re=1. 


The grid, shown in Fig. 13.8, is chosen rough deliberately. Figs. 13.9, 13.10 
and 13.11 give streamlines and isobars for the staggered discretization, a fi- 
nite element code using Qı — Po elements (quadrilaterals with bilinear basis 
functions for the velocity and constant basis functions for the pressure) and 
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Fig. 13.8. Grid for Poiseuille flow. 


a commercial code! using colocated discretization with pressure-weighted in- 
terpolation. Fig. 13.12 shows an even wilder grid, meant to investigate the 


Fig. 13.9. Streamlines and isobars for staggered discretization. 


effect of sudden refinement and derefinement. Results are shown in Figs. 13.13 
and 13.14. We do not have results for the colocated code for this case. The 
streamlines should be straight and the isobars straight and equally spaced. 
Clearly, the staggered discretization is more accurate than the other two 
methods. This illustrates that staggered schemes are not inherently inaccu- 
rate on general grids. On the contrary, they can be quite accurate, provided 
the smoothness properties of the boundary-fitted coordinate mapping are 
carefully taken into account. This can be done in the way described above. 


! Results kindly provided by Mr. R. Agtersloot (Delft Hydraulics). 
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Fig. 13.10. Streamlines and isobars for finite element method. 


13.7 Verification and validation 


After results have been obtained with a computational fluid dynamics com- 
puter code, the credibility of these results needs to be assessed. Two types of 
errors may be distinguished: modeling errors and numerical errors. Modeling 
errors arise from not solving the right equations. Numerical errors arise from 
not solving the equations right. The assessment of modeling errors is called 
validation, whereas the assessment of numerical errors is called verification. 


Validation 


For simulation, a conceptual model of physical reality is built. Exactly what 
physical reality is turns out to be a surprisingly difficult question, that lacks 
a unanimous answer, and is at the core of the philosophical discipline of epis- 
tomology, and seems to transcend scientifc discourse. While justly wondering 
at what has been called the unreasonable effectiveness of mathematics in de- 
scribing physical phenomena, we put our trust in the apparently immutable 
regularities and laws of nature, that exist independently from what science 
makes of this. In short, we accept physical reality as given. Conceptual mod- 
els in fluid dynamics are mathematical models consisting of equations. The 
modeling error is the discrepancy between physical reality and the math- 
ematical model. Modeling errors may arise from simplifications introduced 
in the Navier-Stokes equations, such as neglecting viscous terms, or making 
thin-layer approximations, or assuming two-dimensionality or some form of 
symmetry, or, most importantly of course, the use of a turbulence model. For 
validation, comparison with physical experiment is necessary, or comparison 
with computed results obtained with mathematical models involving fewer 
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Fig. 13.11. Streamlines and isobars for colocated discretization. 





Fig. 13.12. Grid for Poiseuille flow. 


assumptions, such as direct numerical simulation. Much experimental and 
numerical information for flows of varying degrees of complexity is available 
in databases that can be accessed via Internet. Good points of entry to this 
information are the following websites: 

www.cfd-online.com 

www. princeton. edu/*gasdyn/Fluid_DynLinkd/data.html 
and the ERCOFTAC (European Research Community on Flow, Turbulence 
and Combustion) website: 

imhefwww.epfl.ch/ERCOFTAC 
Of course, not only numerical but also experimental results are contaminated 
by errors. Validation makes sense only after verification, otherwise agreement 
between measured and computed results may well be fortuitous. 
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Fig. 13.13. Streamlines and isobars for staggered discretization. 


Ley 


Fig. 13.14. Streamlines and isobars for finite element method. 


Benchmark problems 


For verification, comparison with results obtained with other computer codes 
for the same mathematical model is very valuable. Extensively studied flow 
cases using a well-defined mathematical model for which reliable numeri- 
cal results are available are called benchmark problems. In benchmarking, 
results that are known to be correct should be reproduced. This helps to 
eliminate coding errors. Benchmarking is also useful for showing progress, 
such as enhanced efficiency, with respect to older methods. We will now give 
some pointers to the literature concerning widely used benchmark problems. 
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A benchmark for which an analytic solution is available has been discussed 
in Sect. 13.6. For more realistic cases, exact solutions are not available. A 
benchmark that has been studied extensively is the case of the backward 
facing step, for which some results were shown in Sect. 6.5. These results are 
not of benchmark quality, but are meant to illustrate the influence of outflow 
boundary conditions. Good results may be found in Morgan, Périaux, and 
Thomasset (1984), Kim and Moin (1985), Gartling (1990), Sani and Gresho 
(1994), Freitas (1995), where the two-dimensional flow over a backward fac- 
ing step is discussed extensively, and much numerical information about the 
solution is gathered. Physical experiments provide only limited information 
about the exact two-dimensional solution for the backward facing step, be- 
cause physical experiments are necessarily three-dimensional, and it seems 
that in the three-dimensional case a closed separation bubble does not occur 
(Wesseling et al. (1994)). Closed separation regions do not normally exist 
in three dimensions. In fact, separation is hard to define in three dimen- 
sions. In two-dimensional computations, the length of the bubble increases 
significantly with the Reynolds number. Consequently, if numerical diffusion 
generated by the discretization of the inertia terms is non-negligible (effec- 
tively decreasing the Reynolds number), the computed separation bubble 
will be too short. The first order upwind scheme is found to be wanting. This 
benchmark shows up the need for higher order discretization of the inertia 
terms, perhaps using a slope limited scheme (Sect. 4.8) to rule out spurious 
wiggles. This benchmark can be used to test codes on single block and multi- 
block Cartesian grids (Fig. 13.15), and on curvilinear boundary-fitted grids 
(Fig. 6.3). 


Another frequently used benchmark is the flow in a closed cavity. This case 
can be handled already by simple codes, because only the simplest type of 





Fig. 13.15. Single block and multi-block Cartesian grids for flow over a backward 
facing step. 


boundary condition occurs, namely the no-slip condition, and if the cavity 
is rectangular Cartesian grids suffice. For these reasons, this benchmark has 
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been around a long time. Flow may be induced by having one of the walls 
move (lid-driven cavity) or by applying temperature differences to the walls 
(buoyancy driven cavity). In the latter case the Boussinesq equations are 
solved instead of the Navier-Stokes equations. For the lid-driven cavity, re- 
sults of benchmark quality are given in Ghia, Ghia, and Shin (1982). Some 
publications in which this test case is discussed are Armaly, Durst, Pereira, 
and Schonung (1983), Burggraf (1966), Leschziner (1980), Kim and Moin 
(1985), Shyy, Thakur, and Wright (1992), Zang, Street, and Koseff (1994). 
Benchmark quality results for buoyancy-driven cavity flow are given in de 
Vahl Davis (1983), Hortman, Perić, and Scheuerer (1990). 


If instead of being rectangular the shape of the cavity is skewed or otherwise 
distorted, a test case for codes employing non-orthogonal grids is obtained. 
Examples may be found in Demirdžić, Lilek, and Perić (1992), Oosterlee ef al. 
(1993). 


A third benchmark is the unconfined flow past a circular cylinder. At 
Reynolds numbers above about 40 the flow becomes unsteady, and periodic 
vortex shedding occurs, giving rise to a so-called Karman vortex street. So this 
test case is suitable to test both spatial and temporal discretization schemes. 
Results are given in Engelman and Jamnia (1990), Rosenfeld, Kwak, and Vi- 
nokur (1991), Marx (1994), Freitas (1995); further references may be found 
in these publications. 


Verification 


Verification is the process by which the degree of numerical accuracy is as- 
sessed. For benchmark problems checking numerical accuracy is relatively 
easy, because accurate results are available to compare with. But when a 
CFD code is used for the purpose for which it is designed, namely to predict 
flow properties that are a priori unknown, there is nothing to compare with. 
Nevertheless, numerical error assessment is possible to some extent, and we 
will briefly discuss this topic below. 


Numerical errors may be divided into three kinds: the (global) discretization 
error, the iterative convergence error and the rounding error. The rounding 
error is due to the finite precision arithmetic of computers, and is in CFD 
applications usually negligible compared to the two other sources of numer- 
ical error. Therefore the rounding error will not be further discussed. The 
discretization error is due to the replacement of the differential equations by 
an algebraic system (discretization). The iterative convergence error arises 
from inexact solution of the algebraic system by some iteration method. 


It was shown in Sect. 7.2 for the simple case of stationary iterative methods 
for linear problems, that the difference between two successive iterates does 
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not give much indication about the size of the iterative convergence error. 
This holds a forteriori in more complicated situations. A better indication 
of the error can be got from looking at the residual. Of course, the relation 
between the residual and the error is complicated; for example, for linear sys- 
tems the condition number, which is usually not known, enters into play. But 
often an interpretation of the residual can be given which provides a guideline 
as to how small it should be made. For instance, when iteration stops it is 
known that one has solved exactly a system in which the right-hand side is 
perturbed by the residual, or, in a time stepping method, the residual can be 
interpreted as a perturbation of the solution at the preceding time level. 


If the discretization scheme is sound, the discretization error can be made 
smaller by decreasing the time step and by refining the spatial grid. The 
question is how fine the grid should be. In principle this question can be 
answered by comparing numerical solutions at different levels of refinement. 
If the difference between numerical solutions at two different levels of refine- 
ment is negligible from an engineering point of view, the numerical solution 
obtained may be said to be grid-converged, and the error is dominated by 
the modeling error. On modern computers the available memory is frequently 
large enough to allow sufficient refinement in two dimensions, but in three- 
dimensional applications it may easily happen that the grid is not sufficiently 
fine. The least one can do in this situation is to give an indication of the size 
of the error, by showing the discrepancy between numerical solutions at two 
refinement levels. Furthermore, one may try to estimate the zero mesh size 
limit of the solution (i.e. the grid-converged numerical solution) by Richard- 
son extrapolation, which we will now briefly discuss. 


Richardson extrapolation 


The method is named after Richardson (1910), but the idea behind it is 
much older, and was already used by Archimedes (250 BC) in his method to 
approximate the value of 7. 


Let us consider a stationary problem discretized on a boundary-fitted grid. 
In €-space the grid is uniform. Assume that the mesh size is A“ = cah 
in the £%-direction, with c, constant, and h a parameter that governs the 
degree of refinement. Let æ be a grid point belonging to all grids beyond a 
certain refinement level; many such points exist if the refinement parameter 
h is chosen as follows: 


h € H{1,1/2,1/4,..}. 


Let y stand for one of the unknowns, for example a velocity component or 
the pressure. The global discretization error is defined as €, = pp — p, with 
pn the numerical solution (computed exactly), and » the exact solution. It 
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is assumed that €} has the following asymptotic expansion as A | 0: 
€n(@) = e(w)h? + of h?) (13.36) 


for some known exponent p; how to find p will be discussed later. Although 
the function e(«) is of course not known, we can use (13.36) to extrapolate 
the numerical solution to A = 0, by using the numerical solution at two 
refinement level in the following way. Writing 


Po hlo Pas 


we have 
Ph = Po + ehP 3 
Pri2 = ~ote(h/2)? . 
Elimination of e results in 
mw P Ph/2 — Ph 
70 op] 


It remains to determine p. The existence of the asymptotic expansion (13.36) 
has been rigorously proved and p is known a priori only for simple cases. 
If the exact solution is a sufficient number of times differentiable, and the 
numerical solution shows no trace of spurious wiggles, and the numerical 
boundary conditions have been implemented in a suitable way, then p equals 
the formal order of accuracy of the scheme. For example, for the convection- 
diffusion equation with a second order central scheme for the convection term 
we have p = 2. If the first order upwind scheme is used for the convection 
term then p = 1. But in realistic applications with complicated geometries, 
the exact solution may not be sufficiently differentiable near corners of the 
domain. Therefore it is safest to estimate p by introducing a third refinement 
level. Assuming that the h/2 level is the finest refinement that the available 
computer memory allows, we have to use a coarser level, namely 2h. This 
gives 


Yan = Yo + e(2h)?. 
We obtain 


P2h — Ph = oP 


(13.37) 
Ph — Ph/2 


from which p may be determined. If the left-hand side of (13.37) is not posi- 
tive, h is too large to neglect the higher order terms in (13.36), or an asymp- 
totic expansion of the form (13.36) does not exist. 
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Because in realistic applications usually the existence of the asymptotic ex- 
pansion (13.36) has not been established theoretically, Richardson extrapo- 
lation is not a watertight procedure. But if a likely value of p, e.g. p = 2 
for a formally second order scheme is confirmed by (13.37), then Richardson 
extrapolation may be said to work. If there is no a priori guess of p, but 
[Ph — Yr/2| is significantly smaller that |~2_ — yal, and (18.37) gives approx- 
imately the same value of p in large parts of the domain (perhaps not in the 
vicinity of isolated points, which may indicate a local singularity), then again 
Richardson extrapolation may be said to work. If Richardson extrapolation 
works then po can be accepted as the grid-converged numerical solution. If 
|n/2 — Yn| is sufficiently small, then Richardson extrapolation is not neces- 
sary, and ,/2 can be accepted as the grid-converged numerical solution. 


For time-dependent problems it is best to construct numerical solutions that 
are independent of the spatial grid in every time step, and then apply Richard- 
son extrapolation in time. 


A good source on uncertainty in numerical computing is Celik, Chen, and 
Roache (1993). 


Concluding remarks 


We emphasize again that when one computes flows with the purpose to make 
engineering predictions, or to increase physical understanding, or to con- 
tribute to improvements in flow modeling, the issue of numerical accuracy 
should at the very least. be addressed, and preferably an estimate of the nu- 
merical error should be given, or grid-convergence of the numerical solution 
demonstrated. 


Even grid-converged numerical results obtained with a well-tested code can be 
wrong if there is something amiss with the underlying mathematical model. 
To avoid this, the user of CFD codes needs to be familiar with classical fluid 
dynamics. Often this enables one to see that numerical results are not cor- 
rect. For instance, symmetry conditions may be falsely imposed, resulting in 
a stationary symmetric solution, whereas it is known from fluid dynamics 
that in reality the flow is unsymmetric and oscillatory. Physical insight and 
knowledge of fluid dynamics are needed to choose the mathematical model 
and the boundary conditions correctly. Use of CFD codes without physical 
insight easily leads to results of unnecessarily poor quality or even to erro- 
neous results. 


A very good comprehensive discussion on verification and validation of com- 
puting methods is given in Roache (1998b). The topic is adressed by several 
authors in the fifth issue of Volume 36 (1998) of the AIAA Journal. 


14. Unified methods for computing 
incompressible and compressible flow 


14.1 The need for unified methods 


If there are regions in the flow domain where the Mach number M is not 
small, the incompressible Navier-Stokes or Euler equations cannot be ap- 
plied. In principle, the compressible equations of motion are uniformly valid 
as the Mach number ranges from zero to supersonic (until real gas effects set 
in). Therefore it suffices to forego the simplifications that incompressibility 
brings, and all one has to do is to employ the compressible equations of mo- 
tion. However, as will be discussed below, the standard numerical methods 
that have been developed for compressible flows (discussed in Chap. 10 and 
12) break down or do not function properly when M < 0.2. Methods to rem- 
edy this will be discussed in the present chapter. It would be ideal if a unified 
method could be found that is accurate and efficient for both compressible 
and incompressible flows. Such unified methods with accuracy and efficiency 
more or less uniform in the Mach number have indeed been proposed, and 
will be discussed below. 


Typical circumstances in which the Mach number is small but where com- 
pressibility cannot be neglected are the following. Due to the geometry of the 
flow domain, local high speed zones may be embedded in a low speed flow. 
For example, in an inlet of an internal combustion engine, the flow may be 
virtually incompressible (M < 0.1) in the bulk of the inlet channel, but we 
may have M = 0.5 near the valve. Another example of geometrically induced 
local high speed zones is the flow around the wing of a transport aircraft in 
landing or take-off configuration. For transport aircraft at minimum speed 
the freestream Mach number satisfies M = 0.2. If this is the case uniformly 
in the flow domain, incompressible flow models give sufficiently accurate re- 
sults. However, around aircraft in landing and take-off configurations, the 
local Mach number may be much larger than 0.2, and even regions of su- 
personic flow may occur around high-lift devices. This necessitates the use 
of the compressible Navier-Stokes equations for the prediction of maximum 
lift. At the opposite end of the velocity spectrum in aeronautical engineer- 
ing, during the hypersonic flight phase of a re-entering space vehicle, thick 
boundary layers and separation zones develop, in which M < 0.2, possibly 
resulting in inaccurate results and/or inefficient numerical processes for im- 
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portant quantities such as heat transfer at the body surface. Of course, low 
speed zones occur also in high speed flows near stagnation points, but in 
practice this does not seem to cause difficulties for standard compressible 
methods, presumably because these zones are small. Low speed flows may 
also be compressible because of density changes induced by strong heat ad- 
dition, for example in flows with combustion. 


First, we will explain why standard computing methods for compressible flow 
break down in the limit M | 0. Then we will discuss the concept of precondi- 
tioning, by which methods for compressible flow may be extended to the low 
Mach number regime. Finally, we will generalize a method for incompressible 
flows to a method with approximately Mach-uniform accuracy and efficiency, 
for Mach numbers ranging from zero to supersonic. 


14.2 Difficulties with the zero Mach number limit 


We start with the one-dimensional Euler equations (10.1): 


Pp m 
U: + FU): =0, U= m } f= m?/p+ P(p, e) . (14.1) 
pE mH 


The following considerations show that numerical difficulties are to be ex- 
pected when M | 0, with M = u/c, u = m/p, and c the speed of sound. 


Spectral number of Jacobian 


According to (10.9), the eigenvalues of the Jacobian f (U) are given by 
Auc, =u, Ag=ute. 
The spectral number of the Jacobian is defined by 
K(f (U)) = APAE, 


with p the spectral radius. For simplicity we assume that u > 0. Then we 
find 
i 1 1 1 M+! 


k(f)=({u+ce) max{ 7, uae! = max{1 + ww M-i . 


When «( f) > 1, f is ill-conditioned, and numerical difficulties may crop up. 
We see that f is singular in the sonic limit M — 1 and in the incompressible 


limit M { 0. As we saw in Chap. 10, in computing methods for compress- 
ible flows a carefully designed artificial viscosity term is incorporated, either 
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implicitly as in approximate Riemann solvers and flux splitting methods, or 
explicitly as in the Jameson-Schmidt-Turkel scheme. This is found to take 
care of the singularity at M = 1. However, no special measures are taken 
for the limit M | 0, and in practice these methods are inapplicable when 
large regions with M < 0.2 are present. We will not delve deeply into the 
underlying causes for this. For more information, see for example Koren and 
van Leer (1995), Guillard and Viozat (1999). But we will give a few more 
indications of numerical troubles to be expected with standard methods for 
compressible flow (such as those presented in Chap. 10) as M | 0. 


Stiffness of the equations 


In Sect. 10.6 we found the following heuristic stability condition for the ex- 
plicit Euler scheme: 


T<h/(u+e). (14.2) 


If acoustic effects are absent, in low-subsonic flow the physical time step 7, 
that is in balance with the physical time scale L/u is given by 


T = h/u. 
We find 
T/T < M/(1 +M), 


so that the numerical time step needs to be much smaller than the physical 
time step if M < 1. When this happens a system of differential equations is 
called stiff. If no special measures are taken, numerical inefficiency results, 
caused by the need to resolve acoustic modes, which travel with speed u c. 
However, if acoustic modes are not present in the flow, they need not be 
resolved, and the stringent stability condition (14.2) is a numerical artifact 
associated with standard computing methods for compressible flows. 


The accuracy problem 


Stiffness of the equations is not the only problem that besets standard com- 
pressible flow computing methods at low speed. There is also an accuracy 
problem. When the stiff system is solved by carrying out the required very 
large number of small time steps, often wrong results are obtained for low 
Mach numbers. This is shown and analyzed in Guillard and Viozat. (1999), 
Turkel, Fiterman, and van Leer (1994). 
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Choice of units 


Another indication of numerical trouble associated with the limit M | 0 shows 
itself when the Euler equations are made dimensionless. The influence of the 
choice of units is more conveniently shown for the nonconservative formula- 
tion (10.7) than for the conservative formulation (14.1). This nonconservative 
formulation can be rewritten as follows, assuming a perfect gas: 


pr+m, =l, 
me +(um+ p) =0, (14.3) 
pt + ypu, + up, =0. 


We choose units to make these equations dimensionless. As noted in Sect. 1.5, 
there are four basic units to be chosen, for which we take the unit of length 
z,, of velocity up, of mass p,22. The unit of time is deduced from this as 
tp = z,/u,. The unit of pressure follows as p, = p,u2; but we may choose 
pr differently, the consequence of which may be the introduction of some co- 
efficient in the governing equations. For the moment the choice of p, is left 
open. We let z, be a characteristic dimension of the domain, and u, a typi- 
cal velocity magnitude, such as the velocity at infinity for the flow around a 
body. Our choice of the time unit reflects the fact that we are not interested 
in acoustics, for which the physical time scale is much smaller than t,. 


Dimensionless quantities are defined by p = Pl Pr; p = p/pr etc. The di- 
mensionless form of (14.3) is found to be, upon deleting primes for brevity: 


Pet Mr =0, 

m, + (um); + pr = 0, 
pru 

Pi + YPur + up, =Q. 


In compressible fluid dynamics it is customary and reasonable to choose for 
Ppr a value that is representative for the magnitude of the pressure in the 
flow, for example the outlet or free stream pressure. Furthermore, naturally, 
pr is chosen such that it is representative for the magnitude of the density of 
the flow. For example, pp may be chosen equal to the density at the inflow 
boundary. As a consequence, c, = \/Yp,/p, is an estimate for the magnitude 
of the sound speed in the flow. The dimensionless momentum equation can 
be written as 


1 
mi + (um) + yM?” =0, M, =u,/cr, 


and the reference Mach number M, is representative for the magnitude of 
the Mach number that occurs in the flow. We see that this equation becomes 
singular as M, | 0, which is again an indication of the difficulties to be 
expected for low subsonic flow with methods developed for compressible flow 
only. 
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14.3 Preconditioning 
The principle 


Standard methods for computing compressible flows, such as those described 
in Chap. 10, are numerically inefficient for weakly compressible flows, because 
of the stiffness of the governing equations, discussed in Sect. 14.2. One way 
to alleviate this problem is to change the governing equations. This looks 
like putting an end to the disease by killing the patient, but is not as bad as 
it seems. The procedure is called preconditioning, and is not to be confused 
with preconditioning in the field of numerical linear algebra. Precondition- 
ing not only removes the stiffness problem, but solves the accuracy problem 
as well, as shown in Guillard and Viozat (1999), Turkel, Fiterman, and van 
Leer (1994). A practical advantage of the preconditioning approach is that 
existing codes for compressible flows can be easily modified to improve their 
performance for weakly compressible flows. 


The principles of preconditioning will be explained for the one-dimensional 
inviscid case. The flow is governed by the Euler equations (14.1). Precondi- 
tioning consists of modifying the equations artificially by multiplication of 
the time-derivative by a matrix P7!: 


PU, + F(U): =0. (14.4) 


Stationary solutions are not affected by preconditioning, but time accuracy 
is lost. The preconditioning matrix P(U) is to be chosen such that (14.4) is 
less stiff than the original system as M | 0. The eigenvalues of the system 
are now given by A(P f), and should remain closer together as M | 0 than 


Af). 


An example of a preconditioner 


The design of P(U) has been and still is the subject of much research. Pointers 
to the literature will be given later. A generally accepted favorite precondi- 
tioner does not seem to have emerged yet. 


We will discuss the preconditioner proposed in Weiss and Smith (1995). We 
switch to new primitive variables Q: 


with T the temperature. The preconditioned system is 


PQ: + F(Q): =0 (14.5) 
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with 

0 0 pr 

r= | 6u pP upT ; 

0H —1 pu Hpr + pep 
where ĝ is a parameter to be chosen. H is the total enthalpy given by 
H = h+ zu’, h = ¢pT, cp is the specific heat at constant pressure, 
and pr = (0p/0T)>, i.e. the temperature derivative of p with p kept con- 
stant. 


Because I is variable, (14.5) is not in conservation form, but the stationary 
part is in conservation form. 


The eigenvalues | 


In order to see whether (14.5) is less stiff than (14.3) we have to inspect the 
eigenvalues \(F~!f (Q)). For the parameter 9 Weiss and Smith make the 
following choice: 


0 = 1/w* — pr/(pep) , (14.6) 


where w is a velocity magnitude that will be chosen later. In Example 14.3.1 
we find: 


AITES (Q)) = {u, Ete}, (14.7) 
i=u(l-a), €= Vau? +w, a= (1-— ßw?)/2, 
p= (Pp + Pr ) ' 

PCp 


where pp is the pressure derivative with T constant. For w the following choice 
is made: 


ec if lul<ec, 
w= ¢ jul if ec<ful<e, 
c if [ul >c, 
with c the speed of sound. The small parameter e (~ 1075) is included to 


prevent singularities in points where u = Q. In the viscous case (not considered 
here), w is replaced by w, given by 


w = max(w, v/h) 


where A is the intercell length scale over which diffusion occurs; v is the 
kinematic viscosity coefficient. 


We now inspect the behavior of the eigenvalues as M varies. For a perfect gas 
we have 
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B=, (14.8) 


so that a = (1 — M2)/2, Mwy = w/e. Hence, when M = [u|/c > 1 we 
have a = 0, and the eigenvalues are equal to those of the unpreconditioned 
system: 


APF (Q)) = {u, utc}. 
At low speed (|u| < c) we have a = 1/2, and w+é = fu(14 V5), so that the 
magnitude of the three eigenvalues does not differ much. Hence, the stiffness 
has been removed from the system. 


Example 14.3.1. Derivation of eigenvalues 


In this example we derive (14.7). The eigenvalues (P71 (Q)) of the pre- 
conditioned system (14.5) satisfy: 


det(f (Q) -AL) =0. 
We find 
ppu — A0 P PTU — Apr 
f -AP|=| ppu? +1- Aud 2pu — Àp pru? — Apru 
ppuH — AHO +A Spu? + ph —Apu (prH + pep)(u— 2d) 


By subtraction of the first row multiplied by u from the second, and subtrac- 
tion of the first row multiplied by H from the third we get: 


lf -AF|=p 1 u—À 0 =0. 
À u(u—A) pcp(u—A) 
This gives 


(u—A){pep — u’ (pr + pppcp) + Au(2Pr + pppcp + pcp) 
—A*(pr + Opep} = 0. 


Hence, we find one eigenvalue A = u. We continue with the second factor. 
Substitution for 0 from (14.6) gives 


A? — u(l + bw?)à — w? (1 — Bu?)=0. 


The roots are given by 


A=Uute, t=u(l—-a), č= Vau +w. 
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A numerical scheme 


We will now discretize (14.5) in space. For discretization in time the methods 
discussed in Chap. 10 can be followed. Applying finite volume integration (as 
in Chap. 10), we get 


dQ; 
dt 
where h; is the size of the cell. It remains to specify the cell face flux function 


F;41/2- Following Weiss and Smith (1995) we use a Roe type scheme for this. 
In Sect. 10.3 we saw that the Roe scheme gives 


hy Ty? + FQ) TYG =O, (14.9) 


1 1 
Fy 4/2 = 91 F(Q5) + f(Q5+41)} - zl4j+1/2|(Qj+1 — Q;) , 


where A = ðf /ƏQ. This means that the spatial discretization is unaltered 
by the preconditioning. Practical experience and the analysis of Guillard and 
Viozat (1999), Turkel, Fiterman, and van Leer (1994) show that this gives 
unsatisfactory results at low Mach numbers. Much better results are obtained 
if the preconditioning is allowed to influence the artificial diffusion term. This 
can be done by choosing 


1 ] 
Fj+1/2 = 9 (f(Q;) + f(Qj41)} - z Ti+ Alji —Q;). 
(14.10) 


This does not change the consistency of the scheme, since we still have a 
central scheme with artificial diffusion. But it does change the numerical 
solution. 


The next step is the evaluation of the artificial diffusion term. This can be 
done in the same way as for the Roe scheme. Let R be the matrix with 
the eigenvectors R1, R2, Ra of [1A as columns, and let Rt, R?, R? be row 
vectors defined by 


RIR, = 63 . 
Define 
ap = R? (Qj+1 — Q;) . 
Then 
3 
[P~*Alj41/2(Qj41 — Q3) = D> plop Rp ; 
p=l 


where, as we saw before, Ay = u, Az = Ŭŭ — č, Àa = ù + č. In Example 14.3.2 
we show that 
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0 au+č au — Ë 
1 (au +č)(H + 5) (au -EH + =) 


and 
R! -(H + cen) 0 1 
R | =| L(1—-pufau-é)) (au-é 0]. 
+(—1+ pu(ati+é)) —Z(au+é) 0 


The matrices [7341/2 and Aj4,1/2 are not evaluated at the Roe average of 
Qj41 and Qj, because the nice properties of the Roe average are now lost. 
Instead, evaluation takes place at (Q; + Qj41)/2. 


For extension of this method to the two-dimensional viscous case, see Weiss 
and Smith (1995), where successful applications to fully and weakly com- 
pressible flows are shown. 


Example 14.3.2. Etgenvectors for artificial diffusion term 


The eigenvectors of [—1A satisfy 


(A-P)R= 
Ppu — A0 p PTU — Apr rı 
ppu? +1 — Aud 2pu — Àp pru? —Apru r2 
ppulf — AHO +À Spu? + ph — Apu (prH + pcp)(u— A) r3 


=0. 


For A = A, = u we see immediately that a solution is 


By subtraction of the first row multiplied by u from the second and subtrac- 
tion of the first row multiplied by H from the third we get 


Ppu — 0 p prlu — à) rı 
1 plu — ) 0 r2 — ur | =0. 
À pu(u—A) pcp{u— à) r3 — Ary 


Hence 


ry = uri + —, r3 = Hr +. 
p(A — u) PCy 


For À = Àz = u—€ this gives 
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ri ri 
rg = ury — ————, r3 = Hri + —. 
2 1 plau + č)’ 3 i pep 


We choose rı = au + č, and obtain: 


pe 
Similarly, for À = A3 = a + č we obtain the following eigenvector: 
au—€é 
R; = u(ļau — è) — + 
(H++)\(au-8) 


The associated bi-orthonormal system of row vectors RI satisfying 
R3 - Rp = ô} is given by 


R? = R; x R,/Ri:(R2x Rs), pig,r cyclic. 


We find 
R! -H — = 0 1 
R | = zg(1 — pulau — é)) £(au—¢) 0 
R? zz(—1 + pu(au+é)) —£&(au +2) 0 


Unsteady flows 


To compute unsteady flows with a preconditioned scheme, time accuracy has 
to be restored. One way to do this is to use dual time stepping. A pseudo-time 
s is introduced. The physical time is denoted by t. Equation (14.9) is replaced 
by 


dU; 


dU; dQj 
7 dt 


h 
ds 


;+1/2 
+ jl; + F(Q) = 0. 
Time discretization methods in t and s are chosen. For simplicity, we choose 
the implicit Euler scheme in time and the explicit Euler scheme in pseudo- 


time, and obtain: 


h; (OU mos m n h;I; m m- m=1;jj 
= (3a), (ar — ap) + BE (gn — grt) + FQ" = 0, 


where r is the time step and ø is the pseudo-time step. The superscript n 
counts physical time steps and the index m counts pseudo-time steps. For 
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m = 0 we put Q™ = Q”. The next time step is executed by stepping forward 
with m = 1,2,...,m, where m follows from the condition that steady state 
has been reached in pseudo-time: 


Aj ly a > 
= (QF - QP" < tol <1. 


When this holds the pseudo-time difference can be neglected, so that we have 
completed a time step for the physical system 
dU; j+1/2 
it + F(Q) Fai = 0 a 
We put Q"+! = Q”, and the process can be repeated for the next physical 
time step. 


Because the preconditioning infiuences the artificial diffusion term in F(Q), 
different, and as it turns out, more accurate results are obtained for weakly 
compressible flows than with the original unpreconditioned equations. 


In Weiss and Smith (1995) it is shown that dual time stepping is significantly 
more efficient than physical time stepping with the original system for low 
Mach numbers. Nevertheless, dual time stepping is expensive, because a large 
number of pseudo-time steps (30 in an example given in Weiss and Smith 
(1995)) is required for each physical time step. Hence, the efficiency lags 
behind that of incompressible fiow solvers, so that dual time stepping methods 
cannot be called efficient. 


In Guillard and Viozat (1999) it is shown that that pseudo-time stepping can 
be dispensed with. We can solve directly 


with Fj41/2 given by (14.10). As remarked in Paillére et al. (1998), for explicit 
schemes stability requires a very small time step, so that an implicit time 
stepping scheme must be used. If this is necessary anyway, because of a large 
disparity in mesh sizes for example, this is no disadvantage. However, it turns 
out that the resulting nonlinear algebraic system is difficult, and iterative 
methods that have been used until now require much computing time. This 
still needs improvement. 


Final remarks 


For extensions to more dimensions, the viscous case, flows with combustion 
and chemistry and implementation of boundary conditions, the reader is re- 
ferred to the literature. For a review of early literature, see Turkel (1993). The 
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preconditioner of Weiss and Smith (1995) described above is an extension of 
the one introduced in Choi and Merkle (1985), Choi and Merkle (1993). Iter- 
ative methods for implicit preconditioned schemes are analyzed in Koren and 
van Leer (1995), Koren (1996), Darmofal (1998). Some recent publications 
are: Turkel, Radespiel, and Kroll (1997), Edwards and Roy (1998), Guillard 
and Viozat (1999), Mary, Sagaut, and Deville (2000). 


Preconditioning is a way to extend the functionality of existing codes for fully 
compressible flows to almost incompressible flows. The efficiency of methods 
for incompressible flows is not matched. In the next sections the reverse route 
will be followed, namely extension of methods for incompressible flows to 
compressible flows. We will find that this results in methods that can match 
the efficiency of compressible flow methods for fully compressible flows, with 
superior efficiency for weakly compressible flows. 


Exercise 14.3.1. Derive (14.8). 


14.4 Mach-uniform dimensionless Euler equations 


As we saw in Sect. 14.2, the Euler equations become singular as the Mach 
number tends to zero, if the equations are made dimensionless in the way that 
is customary for fully compressible flows. It is obvious that this difficulty also 
occurs for the Navier-Stokes equations. We will see presently that when the 
equations are not made dimensionless, the difficulty manifests itself in the fact 
that as M | 0, the pressure fluctations that drive the flow become vanishingly 
small compared to the absolute pressure. To avoid numerical troubles, this 
has to be remedied. A different way to make the equations dimensionless has 
to be found. The unknowns have to remain bounded both for M | 0 and 
M >> 1. To prepare ourselves, we first make an asymptotic analysis of the 
Euler equations as M | 0. 


Asymptotic expansion 


We consider the three-dimensional case. The nonconservative equations of 
motion (12.3)-(12.5) without heat addition and body force are made dimen- 
sionless in the same way as in Sect. 14.2, repeated here. The three space 
coordinates are made dimensionless by the same reference length z, which 
is representative of the length scales occurring in the solution. The unit of 
velocity u, is representative of the velocity magnitudes that occur, and T, 
and p, are of the order of the magnitudes of the temperature and the density 
of the flow. The reference pressure p, is deduced from the equation of state. 
We obtain the following dimensionless equations: 
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2P + pdivu =0, (14.11) 
Du 1 2 

Du il = = ppu? p, , 14.1 
Pp, t Btadp=0, €= prur/p (14.12) 
a + ypdivu =0. (14.13) 


For a perfect gas we have p,/p, = c?/y, with c, the speed of sound belonging 
to the reference conditions, so that 


e=yM?, M,=u,/e,. 


In order to study the incompressible limit of (14.11)-(14.13), we postulate 
an asymptotic expansion of the following form: 


u(t, x) = uo(t,x) +eu,(t, æ) + O(e?), 
plt,#) = polt, æ) +epilt, 2+ O(e?), (14.14) 
plt,æ) = polt, æ) +epi(t,#) + O(e*). 


This implies that to, u1, po etc. and their t- and x°%-derivatives are inde- 
pendent of £. Since acoustic modes have ¢-dependent time and length scales, 
these are excluded by the Ansatz (14.14). For asymptotic analysis including 
acoustics, see Klein (1995). If the terms tto, t1, po etc. can be determined 
in a consistent way, the asymptotic expansion (14.14) may be safely assumed 
to be correct. 


By substituting (14.14) in (14.11)~(14.13) and equating terms with like pow- 
ers of £ we get the following results. To leading order, equation (14.12) gives 
grad po = 0, hence 


Po = Po(t) . 


Because po does not depend on æ, it has no dynamic effect. If there 
is no global compression or expansion, as for the flow around the body, 
po (t) = constant, and po represents the constant background pressure level. 
If there is net mass inflow or outflow in the flow domain, po(t) represents the 
effect of global compression or expansion. This can be seen as follows. Equa- 
tion (14.13) gives to leading order: 





divuo = —— . (14.15) 
Y 


Integration over the flow domain 22 gives 


dinpo_ 7 
oP = 3 J wos nas (14.16) 
an 
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so that po follows from the normal velocity component at the boundary. This 
makes it clear that po(t) is due to global compression or expansion. 


If the normal velocity is not prescribed along the whole boundary, po(t) must 
be given, but if it is, po(t) follows easily from (14.16). Hence, po(é) may be 
assumed known. Equation (14.11) gives to leading order the equation that 
governs po: 





1 dln po 


ð 
(— + uo: V)ln po = 5 di (14.17) 


Ot 


We still need an equation to determine wo, since (14.15) is not sufficient. The 
equation for uo is obtained from the next order terms of (14.12): 

D 

= + gradp; =ņ. (14.18) 
Equations (14.15)-(14.18) determine to, po and pı; pı acts as a Lagrange 
multiplier to allow uo to satisfy the constraint (14.15). When po and po are 
constant we recover with (14.15) and (14.18) the familiar incompressible Euler 
equations. We may conclude that for the determination of tg, po, po, pı we 
have obtained a well-posed problem, which indicates that the asymptotic 
series (14.14) is correct. 


Definition of dimensionless pressure 


We have just seen that 
p(t, æ) = po(t) + epi(t, æ) + Ole) . 


We see that for € < 1 there is a danger of rounding errors in numerical 
approximations of gradp. This can be avoided by working with p— po instead 
of p. This is possible because, as argued above, po(t) can be determined a 
priori. From (14.18) we see that pı is of the same size as potto - ug. We 
therefore choose the dimensionless pressure as follows: 


P — Po 
P= 3. 14.19 
pruz ” l ) 
where for clarity the hat symbol is used to denote dimensional quantities. 
The other quantities are made dimensionless as in Sect. 10.2. 


Assuming a perfect gas, we have the following dimensional equation of state: 
p= RpT . 


The dimensionless form of the equation of state becomes, noting that 
a3 F 
ĉ = yRT, 
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>= RT — Po _ LT- P, M, = ür fê, , (14.20) 
Pr Uy yM; Prt, 

which is singular as M, | 0, no matter how we choose po. This is a sign that 

in order to handle weakly compressible flows, we should use the pressure as a 

primitive variable and find the density from the equation of state, and not the 

other way around. Therefore the dimensionless equation of state is rewritten 


as p = p(p,T): 





3 
p= (E47 Po Je (14.21) 
p 


We now restrict ourselves to the case where there is no global compression 
or expansion, so that jo(t) = constant. This means that fiows into or out of 
vessels are excluded, and only flows around bodies or through channels are 
considered. We relate jg to fp and T; by the equation of state: 


a 


Po = Rp,T, . (14.22) 
Substitution in (14.21) gives the following dimensionless equation of state: 
p= (1+ yM?2p)/T. (14.23) 


This is precisely what we want. When M, — 0 the dependence of p on p 
disappears, eliminating acoustics, but p still varies with temperature. Sub- 
stitution of (14.22) in (14.19) gives our final definition for the dimensionless 
pressure: 
p 1 
p= | -—. 14.24) 
pr? yM; l 





The dimensionless form of the nonconservative homogeneous equations of 
motion (12.3)—(12.5) is found to be 


pe + divm =0, 





(u°m) + gradp = 0, 


M + ô 

tT pre 
M2{m + div(up) + (y — 1)pdivu} + divu = 0 . (14.25) 

We see that as M, | 0 the system does not become singular, and that the 


pressure equation reduces to the familiar solenoidality condition of incom- 
pressible flow. 


Dimensionless conservative formulation 


The dimensional form of the conservative energy equation is rewritten as 
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j 1 
(copT + spu*)t + (pucpT + zpw’) =0. 


The dimensionless form becomes 

ù? 1 a2 1 
T r. y? T T pu?) =0. 
(» + 2" ) + (om + T, 3r" ). 


Cyt, Cydtr 








Noting that ¢,T = ¢?/y(y — 1) we can rewrite this as 


1 1 
(or +7- 1) M25 pu?) + (sour +77 - MSp) =0. 
t 


T 


This can be put in a more familiar form if we define the dimensionless en- 
thalpy and internal energy as 


h=7T, e=T, 


and if, furthermore, we define the dimensionless total enthalpy and total 
energy as 


where the tilde serves to remind us that these definitions are not standard. 
Then the conservative energy equation takes on its familiar form: 


(pE), + (mÅ) =0, m= pu. (14.26) 


As M, | 0, solving p from the equation of state (14.23) is not possible. 
Therefore the conservative energy equation (14.26) should not be used for 
weakly compressible flow, but the nonconservative version (14.25). 


Boundary conditions 


For the units employed to make the equations dimensionless, appropriate 
values may be deduced from the boundary and/or the initial conditions. 
Boundary conditions for the Euler equations have been discussed in Sect. 10.2, 
12.2 and 12.4. We assume that velocity and temperature are given at the 
inflow boundary, and the pressure at the outflow boundary if the normal 
outflow velocity is subsonic, and at the inflow boundary if the normal inflow 
velocity is supersonic. If the inflow velocity is zero, as in a shock tube problem, 
a suitable velocity unit &, is deduced from the initial conditions. For instance, 
in a shock tube problem one may take 


û, = 2V |pr — pL |/(PR + PL) , 
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where R and L denote the initial right and left states. In this way we obtain 
t,, po and T,. Equation (14.22) gives 6,. The reference speed of sound and 
Mach number are given by 


é.=\V/yRT,, M, = t,/é,. 


The boundary conditions may be time-dependent, and can be summarized as 
follows, taking the one-dimensional case with subsonic outflow as an example: 





Boundary conditions at solid walls and boundary conditions for the Navier- 
Stokes equations are the same as those discussed in Sect. 12.4 and 13.2. 


14.5 A staggered scheme for fully compressible flow 


In the next section the classical staggered scheme of Harlow and Welch (1965) 
for incompressible flow will be extended to compressible flows. But first, in 
the present section, we will develop a staggered scheme for the conservative 
fully compressible Euler equations. The dimensionless form of these equations 
was found in the preceding section to be given by 


prt my, =0, 
mi + (ump), =0, 
(PE): + (mH): =0. 


The primary unknowns are p, ™, E. As noted in the preceding section, this 
leads to trouble for weakly compressible flow, but the case of weak compress- 
ibility will be the topic of the next section. Here we want to investigate if a 
staggered scheme can be used for fully compressible flow. 


Finite volume discretization 


Fig. 14.1 shows a staggered grid with uniform cell-size h. The cell centers 
are located at z; = (j —1/2)h, j = 1,2,...,J. The cell faces are located 
at 2541/2. The momentum unknowns are located at the cell faces, and the 
thermodynamic unknowns are located at the cell centers, quite similar to the 
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4 1 j jt+i J 


Fig. 14.1. A one-dimensional staggered grid. 


staggered scheme of Sect. 6.4. 


The dimensionless governing equations are discretized in space with the finite 
volume method. Integration of the mass conservation equation over a cell with 
center at x; gives, with explicit Euler time stepping: 


1/2 
prt? — ph + rerun) =0, A= Th. 


The first order upwind scheme gives, if uj;41/2 > 0, 


Pi 41/2 =p}, 


whereas the second order slope limited scheme of Sect. 4.8 gives 


Pi44/2 = P; + ae DITAS ~ pi) ' 
Similarly, the momentum equation is discretized as 


m to, 


T12 = mji HAM” + p") |; 
where (um)? can be approximated with the first order upwind scheme or the 


slope limited scheme. Finally, the energy equation is discretized as follows: 


(PĒ)! — (PĒ)? + (up Ã t= 0 


where (pH)? a2 can be approximated with the first order upwind scheme 
or the slope limited scheme. After (p£)"*1 has been obtained, p"+? fol- 
lows from p = (pe — 1)/yM;, (pe); = (PE); — vy — IMP H{(um)j 41/2 + 
(um)j-1/2} uj+1/2 = 2mMj41/2/ (P3 + Pj+1). 


Boundary conditions 


We assume that u > 0, and that inflow and outflow conditions are subsonic. 
The following boundary conditions are assumed: 


u(t,0) = u(t), plt,0) = plt), p(t,1) = pelt). 


We put pJ41/2 = pJ, corresponding to the use of the first order upwind 
scheme at the outflow boundary. For m 41/2, finite volume integration takes 
place over a half cell: 
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n n n n J n 
MIEI — My gaya +AU M”) + 2A plta) - phy) = 0. 
We write 


(PË) = po(t){hi +y- 1)M2 us (t)?} , 


and (pH) 5441/2 = (pH), corresponding to the first order upwind scheme. 
The accuracy of numerical solutions for Riemann problems benefits if the 
initial discontinuous solutions are applied consistently. For example, if the 
discontinuity is at a velocity node z;41/2, we put 


uj+1/2 = (uL +ur)/2, Mj+1/2 = Uj+1/2(PL + pR)/2 - 


and for all j 


(PĒ); = M2 (1+ py) + (7 — IMP (um) 44j2 + (um)j-172} 


Numerical results 


Results for the Riemann problems of Chap. 10 are shown in Figs. 14.2-14.6. 
The slope limited scheme is used with the van Albada limiter of Sect. 10.8. 
The same time stepping scheme is used as in Sect. 10.8 for colocated schemes, 
namely the SHK Runge-Kutta scheme. The estimates for the stability limit 
A; for À are given in Table 10.4. In practice we found that sometimes » has 
to be somewhat smaller than A; for good results. Unless noted otherwise, 
h = 1/48. Comparison with the results of Sect. 10.8 shows that the stag- 
gered scheme has the same accuracy as the colocated schemes of Sect. 10.8. 
In the case of the supersonic shocktube there is a velocity overshoot at the 
beginning of the expansion fan. This is related to the smearing of the den- 
sity, which is due to insufficient resolution. There are only a few cells in the 
narrow zones between the fan, the contact discontinuity and the shock. This 
hampers a staggered scheme more than a colocated scheme, because due to 
the staggered placement of the unknowns the stencil is effectively a bit wider. 


The accuracy improves significantly if the mesh size is halved, as shown by 
Fig. 14.6. Fig. 14.7 shows results for a moving contact discontinuity. There 
are spurious wiggles of size less than 0.2% in velocity and pressure. These 
insignificant wiggles arise from imperfect representation of the initial dis- 
continuity on the staggered grid, and spurious reflection from the outflow 
boundary. 


The staggered scheme requires significantly less computing time than the 
colocated schemes of Sect. 10.8, because the numerical fluxes are much sim- 
pler. We conclude that the staggered scheme converges to genuine weak solu- 
tions that satisfy the entropy condition, and is quite suitable for compressible 
flows. 
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Staggered scheme 


Conservation form 
SHK Runge-Kutta 
van Albada limiter 
t=0.15 

lambda = 0.45 J = 49 





0.5 


Fig. 14.2. Sod’s test problem with staggered scheme. 
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Fig. 14.3. Test case of Lax with staggered scheme. 
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Staggered scheme 


Conservation form 
SHK Runge-Kutta 
van Albada limiter 

t = 0.0875 

lambda = 0.15 J = 49 
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Fig. 14.4. Mach 3 test case with staggered scheme. 
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Fig. 14.5. Supersonic shocktube problem with staggered scheme. 
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ENTROPY Staggered scheme 
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SHK Runge-Kutta 
1 van Albada limiter 
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-0.5 
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Fig. 14.6. Supersonic shocktube problem with h = 1/96. 
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Fig. 14.7. A moving concact discontinuity. 
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14.6 Unified schemes for incompressible and 
compressible flow 


The scheme of the preceding section breaks down when the flow is almost 
incompressible, because the pressure is not used as one of the primary un- 
knowns. In this section a well-established discretization scheme for incom- 
pressible flows will be extended to compressible flows. Obviously, this may 
be expected to result in an efficient and accurate method for weakly com- 
pressible flows. It will come as a pleasant surprise that good performance is 
also obtained for fully compressible flows. 


Point of departure is the classical staggered scheme for incompressible flow 
of Harlow and Welch (1965), discussed in Sect. 6.4. Generalization of this 
scheme to the compressible case has already been undertaken in Harlow and 
Amsden (1968), Harlow and Amsden (1971). There are mainly four differ- 
ences between our approach and that of Harlow and Amsden. In the first 
place, we use the dimensionless pressure as defined by (14.19). This removes 
difficulties with rounding errors at very low Mach numbers. In the second 
place, we use a nonconservative form of the energy equation, for better ef- 
ficiency. It turns out that this does not impair the approximation accuracy 
of genuine weak solutions. In the third place, we let the discretization of 
the mass conservation equation depend on the Mach number; this gives bet- 
ter accuracy for supersonic flow. Finally, we use the MUSCL approach and 
Runge-Kutta time stepping for second order accuracy. 


Governing equations 


The dimensionless equations are used. In order to handle weakly compress- 
ible and incompressible flows, we use the pressure instead of the density as 
primary unknown. The mass conservation equation is written as follows: 


PoPt + pri: +mz =0. (14.27) 
The equation of state (14.23) gives 
2 
-Mr p 
Pp == T 4 PT — T ‘ 
The momentum equation is given by 
m+ (um+p),=0. (14.28) 


For the energy equation we start from the nonconservative dimensional ver- 
sion (14.13). By using the definition (14.24) of the dimensionless pressure and 
the equation of state (14.23) the energy equation can be cast in the following 
dimensionless form: 
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T+ (uT)s + (y—2)Tu, = 0. (14.29) 


We will also use a different set of equations. Instead of the temperature 
equation (14.29) the non-conservative pressure equation (14.13) is used. In 
that case, the dimensionless governing equations are: 


prtm, =0, (14.30) 
m+ (um+p), =0, (14.31) 
M? {pi + (up)e + (y — l)puc} + uz = 0. (14.32) 


Finite volume scheme 


The staggered grid of Fig. 14.1 is used. We propose the following finite volume 
scheme for (14.27)-(14.29): 


yM2(prtt — p) — oP (ret! — TP) + 


ATP (s" pu" + (1 = s”) t t= 0, (14.33) 
mitia ~ Mfg + Alu” m” +p gH =0, (14.34) 
TPH TP + Au” TÉ Ia +A- Trur itia 0, (14.35) 


where A = r/h, p?+1/2 = (p° + p°+!)/2, and s” = s(M”) is a Mach- 
dependent switch function, defined as 

s(M) = 0, M<1/2, 

s(M) = M-1/2, 1/2< |M|< 3/2, 

s(M) = 1, M>3/2. 
Here Mj4is2 = 2|uj4isal/(c; + €j41) with c the speed of sound. It is found 


that the results are insensitive to the definition of s(M), as long as s(M) 
switches from 0 to 1 in some neighbourhood of M = 1. 


We see that in (14.33)—(14.35) some terms are evaluated at the new time level 
tti, namely m in (14.33) and p in (14.34). This is done in order to recover 
the classical staggered scheme in the incompressible limit. 


In (14.33), P;41/2 18 evaluated with a slope limited scheme, as described in 
the preceding section. The same method is used to approximate (u” m”); and 


Tiry 


Pressure-correction method 


In the one-dimensional case the nonlinear system that we have obtained for 
p”ti, T+! and m"*? is easily solved. But in more dimensions iterative meth- 
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ods are required with efficiency uniform or almost uniform in the Mach num- 
ber M,, especially as M, | 0. We therefore generalize an iterative method 
that is efficient in the incompressible case to the compressible case. For non- 
stationary flows we take our point of departure in the pressure-correction 
method, described in Sect. 6.6. For stationary flows it is more efficient to 
generalize one of the iterative methods for incompressible flows described in 
Sect.7.6. But here we restrict ourselves to the nonstationary case. 


First, T”+! is obtained from (14.35). Making this possible is the reason for 
using a non-conservative form for the energy equation. Next, a momentum 
prediction m* is obtained from (14.34), replacing p?+1/? by p”: 


1 


A momentum correction 6m = m"+! — m* is postulated as 


1 . 
dmj41/2 = 5p , Sp=prth—p". 


Substitution of m”+t? = m* + ôm in (14.33) gives the following pressure- 
correction equation for dp: 


1 n n i n i 
M7 6p; -— zA T; {(1- $4 )2)6p -(i- 87_1/2)6Plj-1} = 


PPT! — ATP (s" pr ur) ty (14.36) 


Boundary conditions 


The boundary conditions are implemented as in the preceding section. For 
the pressure-correction equation (14.36) no additional boundary conditions 
are required, just as in the incompressible case discussed in Chapt. 6. We 
have, for j = 1, 


1 
5 plo = —Admry2 = Alpou) lert - 
For j = J the momentum equation is integrated over a half cell: 


* J+1/2 
M4172 7 M4172 + 2A(u™m” + p”)|5 ? = Piy = pe{t”) , 
n+l 


ÔMJ4+1/2 = —A(pol? — dps), 
which replaces in (14.36) for j = J the term Adpii*?. 


Runge-Kutta method 
Instead of the Euler method we can use other time stepping schemes, for 


better temporal accuracy, or for better efficiency, due to weaker stability re- 
strictions on À. For example, we may employ the SHK Runge-Kutta scheme 
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that was used before. Each stage is an Euler time step, so that its imple- 
mentation is obvious. In more dimensions, the most expensive part of the 
algorithm is the solution of the implicit system (14.36). For efficiency, one 
may freeze the implicit part at the old time level in the first three stages, and 
take it at the new time level only in the fourth stage, which is an Euler step 


with the full time step 7. This means that the (m+ 1)th stage becomes: 


Ti") — TP + amp du TO) EA + amp A — 2)TI™ uP PS = 0, 


j-1/ 
mrt = M412 t ampi Aflu” m™ + p”) it! =0. 


In the fourth stage the pressure correction is carried out: 


yM25p; — ATI -s s opi? — 1- s .)6plt_,} = 
(4) 


p(T) = TP) = ATSP (89 u) Ea 


Numerical experiments 


The above scheme has been designed to reduce to the well-established in- 
compressible staggered scheme with pressure-correction, as the Mach number 
tends to zero. It remains to investigate its performance in the fully compress- 
ible case. We do this by carrying out numerical experiments for the same Rie- 
mann problems as in the preceding section. The SHK Runge-Kutta scheme is 
used, as in Sections 14.5 and 10.8, Pressure-correction is applied only in the 
final stage, as just described. The scheme is more stable than the conserva- 
tive scheme of Sect. 14.5 (due to the implicitness of the momentum-pressure 
coupling), but in order to have a fair comparison of accuracy the same values 
of A will be used as in Sect. 14.5. Unless noted otherwise, h = 1/48. 


Numerical results for the test problems of Sect. 14.5 are shown in Figs. 14.8- 
14.13. Comparison with the results obtained with the conservative scheme 
in Sect. 14.5 shows that, except for the contact discontinuity, the accuracy 
obtained is about the same. With the pressure-correction method shock reso- 
lution is a bit less crisp, but shock speed and strength are correct. For the test 
case of Lax the accuracy is somewhat less than for the conservative scheme. 
Grid refinement gives the results shown in Fig. 14.13, which show a better 
approximation of the expansion fan, but still contain a density overshoot be- 
tween the shock and the contact discontinuity. These are close together and 
both strong, which is a numerically difficult situation. For the contact dis- 
continuity spurious wiggles of magnitude 4% are generated, which is rather 
more than for the conservative scheme. 
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Fig. 14.8. Sod’s test problem with pressure-correction scheme. 
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Fig. 14.9. Test case of Lax with pressure-correction scheme. 
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Fig. 14.10. Mach 3 test problem with pressure-correction scheme. 
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Fig. 14.11. Supersonic shocktube problem with pressure-correction 
scheme. 
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Fig. 14.12. A moving contact discontinuity with pressure-correction 
scheme. 
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Fig. 14.13. Test case of Lax with k = 1/96. 
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Second pressure-correction method 


Because of the inaccuracy of the preceding scheme for moving contact discon- 
tinuities we will derive a second pressure-correction scheme, based on the set 
of governing equations (14.30)—(14.32). Because p = constant, u = constant 
satisfies the pressure equation (14.32) exactly, we hope to obtain better accu- 
racy for moving contact discontinuities. Equation (14.30) is used to update 
the density with a slope limited scheme, equation (14.31) is used to compute 
a prediction for the momentum using the pressure at the old time level, and 
equation (14.32) is used to obtain an equation for the pressure correction. 
The predictions for the density and the momentum are made with the SHK 


Runge-Kutta method. The (m+ 1)th Runge-Kutta stage is given by 


m n m m ] 2 
e +D om + dul ) o| yer =0, 


, | 
mpra — Myry +AU m + pjt = 0, 


where pn) j2 and (ul) m')); are approximated with a slope limited scheme. 
There is no Mach-dependent switch function, as in the first pressure-correction 


scheme. We put 


n 4 n 1 } n n 
pt! = pf ) , moti = miia — zapli , p= p°t!— p°, (14.37) 


The pressure equation (14.32) is discretized as follows: 


1/2 j+1/2 i+1/2 
M?{őp; + Alur +per? + Xy- 1)pPurtt itia} + àu” t! itih =0, 
where pj, /2 is approximated with a slope limited scheme. Substitution of 


urti = (m/p)et),, with m™+? given by (14.37), gives a linear system 


for dp. The equivalence with the classical incompressible pressure-correction 
method as M, | 0 is obvious. 


Numerical experiments 


The same test cases are computed as before, with the same values of h and À. 
Results are shown in Figs. 14.14-14.19. Shock resolution is a bit more crisp 
than for the first pressure-correction method. The accuracy for the moving 
contact discontinuity is much better, which is the reason for introducing this 
pressure-correction method. The density overshoot in the density for the test 
problem of Lax is still present, and the approximation of the expansion fan is 
less accurate. Grid refinement gives the more satisfactory results of Fig. 14.19. 
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Further discussion 


We have presented two pressure-correction methods, that reduce to the clas- 
sical incompressible pressure-correction method of Harlow and Welch (1965) 
as the Mach number tends to zero. This makes it possible to design solution 
methods for which the computing work is almost uniform in the Mach num- 
ber. 


The two pressure-correction methods use different formulations of the en- 
ergy equation. Although both formulations are non-conservative, weak solu- 
tions are found to be approximated correctly. But the first pressure-correction 
method may give rather large spurious oscillations for strong contact discon- 
tinuities, whereas the accuracy of the second method is much better for this 
case, and shock resolution is more crisp. 


In Sect. 14.5 results have been presented for the conservative Euler equations 
discretized on a staggered grid. These results illustrate the kind of accuracy 
that may be obtained for fully compressible flows on a staggered grid. The 
accuracy is found to be about the same as for the commonly used colocated 
schemes discussed in Chap. 10. Because the staggered schemes use simple 
central and upwind-biased differences to compute the numerical flux, they 
require less computing time than the colocated schemes, which employ more 
computing-intensive flux splitting methods or second and fourth order arti- 
ficial viscosity terms. Furthermore, for the staggered schemes the coupling 
between density, pressure and momentum is weaker, paving the way for more 
efficient solution methods for implicit schemes. 


It was found both for the colocated and for the staggered schemes, that for 
crisp resolution of discontinuities it is necessary to use both a second order 
slope limited scheme and a time stepping scheme that is stable for the x- 
scheme that corresponds to the limiter used. For example, the van Albada 
limiter corresponds to « = 1/2, as shown in Sect. 4.8. As shown in Chap. 9, 
the explicit Euler scheme is TVD, hence stable. But it is unstable for the 
«-scheme that corresponds to the limiter used, see Sect. 4.8. Hence, with the 
explicit Euler scheme the fruits of higher order spatial accuracy cannot be 
reaped, because the limiter will steer the scheme away from the «-scheme, to 
maintain stability. 


Extension of the staggered one-dimensional schemes for the Euler equations 
discussed in this chapter to more dimensions and to the Navier-Stokes equa- 
tions is straightforward, and can be carried out along the lines laid out 
in Chap. 13. This results in a unified method to compute incompressible 
and compressible flows. Details and results for the first pressure-correction 
method presented above are given in Bij] and Wesseling (1998), where ac- 
curacy and computing work are found to be approximately uniform in the 
Mach number. An extension to hydrodynamic flow with cavitation, involving 
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Fig. 14.14. Sod’s test problem with second pressure-correction scheme. 
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Fig. 14.15. Test case of Lax with second pressure-correction scheme. 
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Fig. 14.16. Mach 3 test problem with second pressure-correction scheme. 
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Fig. 14.17. Supersonic shocktube problem with second pressure-correction 
scheme. 
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Fig. 14.18. Moving contact discontinuity with second pressure-correction 
scheme. 


DENSITY VELOCITY PRESSURE 
1.6 2 4 
1.4 P 3.5 
1.2 1.5 3 
1 2.5 
1 
0.8 2 
0.6 05 1.5 
0.4 1 
0.2 0 0.5 
0.5 1 0 0.5 1 0 0.5 1 
MACHNUMBER ENTROPY Staggered scheme 
1.4 2.5 
12 Pressure correction scheme 2 
2 
1 SHK Runge-Kutta 
0.8 1.5 van Albada limiter 
0.6 1 t=0.141 
0.4 lambda = 0.26 J = 97 
0.5 
0. 
0 0 
0 0.5 1 0 0.5 1 


Fig. 14.19. Test case of Lax with h = 1/96. 
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a nonconvex equation of state, is given in van der Heul, Vuik, and Wesseling 
(1999). Here the Mach number ranges from 107° to 25, which makes a unified 
method for incompressible and compressible flows indispensable. 


Extension of incompressible staggered schemes to the compressible case has 
also been undertaken in Harlow and Amsden (1968), Harlow and Ams- 
den (1971), Issa, Gosman, and Watkins (1986), Karki and Patankar (1989), 
McGuirk and Page (1990), Shyy and Braaten (1988), Van Doormaal, Raithby, 
and McDonald (1987). In these methods the singularity at M = 0 is not re- 
moved, as we do by splitting the pressure according to (14.24). All generalize 
methods of SIMPLE or PISO type (Sect. 7.6) to the compressible case. It 
is shown in McGuirk and Page (1990) that the other methods give much 
shock smearing due to the fact that the velocity is updated in the pressure- 
correction step. This is improved in McGuirk and Page (1990) by updating 
the momentum instead of the velocity. In Shyy and Braaten (1988) better 
crispness is obtained by adaptive local grid refinement. All methods quoted 
derive the pressure-correction equation from the mass conservation equation, 
and are probably not very accurate for moving strong contact discontinuities. 
All use combinations of second order central and first order upwind schemes 
in space, and first order time stepping schemes. It is to be expected that 
the accuracy of these methods can be enhanced by use of second order slope 
limited schemes in space and higher order time stepping schemes. 


Clearly, unified methods for incompressible and compressible flows are within 
reach and may be expected to come into widespread use soon. 
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absolute stability, 168 

absolute temperature, 22 

acceleration method, 269 

accuracy, 163 
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acoustic modes, 569, 579 

acoustics, 35, 37, 570, 579, 581 

Adams-Bashforth scheme, 185, 186, 
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Adams-Bashforth-Crank- Nicolson 
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Adams-Bashforth-Euler scheme, 194 

ADI scheme, 331, 333, 334 

-- loss of accuracy, 333 

adiabatic, 27, 532 

— flow, 23, 24 

— process, 23 

aerothermochemistry, 3 

amplification factor, 173, 219, 220 

amplification matrix, 313 

amplification polynomial, 199 

approximation property, 287, 288 

Arakawa grids, 328 

Archimedes, 564 
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— of cell, 476, 477 

— of cell face, 482 

Aristotle, 43 

artificial compressibility method, 240 

artificial diffusion, 223, 574, 575, 577 

artificial dissipation, 389-391 

artificial viscosity, 127, 388, 389, 568 

artificial viscosity scheme, 381, 388 
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asymptotic expansion, 52 
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backward error analysis, 108 

backward facing step, 245, 562 

barotropic, 25, 26, 28-30, 32 

barrier function, 138, 140 

BDF scheme, 190, 218 

— extrapolated, 218, 226 

Beam-Warming scheme, 349 

benchmark problem, 561-563 

Bernoulli’s law, 437, 448 

Bernoulli’s theorem, 27, 31, 248 

— applied to chimney problem, 248 

Bi-CGSTAB method, 278, 284 

— postconditioned, 285 

bi-conjugate gradient stabilized 
method, 284 

bicharacteristics, 507, 508 
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Blasius resistance formula, 38 

block-centered discretization, 102 

body force, 14, 20 

bottom friction, 51, 52, 309, 315, 323, 
326, 327, 334, 336, 338 

bottom roughness, 51 

boundary condition, 61, 62, 65, 69, 75, 
89, 229, 237, 243, 244, 508, 532, 535, 
550, 582, 584, 591 

~ absorbing, 212 

- artificial, 350, 351, 355, 521 

- far-field, 522 

- for convection equation, 350 

— for Euler equations, 402 

~ for shallow-water equations, 310, 327 

— free surface, 551, 554 

— numerical, 518, 519 

— periodic, 213 

- solid boundary, 520 

— subsonic inflow, 519 

- subsonic outflow, 520 

- supersonic inflow, 519 

— transparent, 212, 214, 230, 521 
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- weakly reflecting, 212, 328 
boundary layer, 2, 32, 70, 75, 76, 111, 
116, 120, 127, 142, 224, 532, 567 

— equation, 72, 73, 76-78 

— numerical, 121, 211 

— ordinary, 76, 78, 146 

- parabolic, 76, 78, 135, 146 
— sohition, 76 

boundary- fitted coordinates, 467, 470, 
538 

Boussinesq 

— closure hypothesis, 41, 42 

- equations, 45-47, 563 

~- dimensionless, 46 

Buckley-Leverett equation, 371, 373 

buoyancy, 563 

buoyant flow, 228 

Burger number, 327 

Burgers equation, 363, 365, 367, 368, 
371, 374, 376 

butterfly, 42 


calorically perfect, 23, 26, 30 

Cartesian tensor notation, 5, 228 

Cauchy-Riemann equations, 59 

causality principle, 367 

cavitation, 13, 597 

cavity flow 

- buoyancy driven, 563 

~ lid-driven, 563 

celerity, 309 

cell area, 476, 477 

cell aspect ratio, 263, 270, 297, 302 

cell edge vector, 476 

cell face area, 482 

cell face vector, 482, 483 

cell volume, 480, 483, 484 

cell-centered, 94, 95, 100-103, 114, 116, 
120, 124, 138, 232, 237, 241, 511, 523 

centigrade scale, 22 

central discretization, 114, 128, 146 

centrifugal acceleration, 48, 49 

CFL number, 191, 210, 222, 313, 342, 
378, 379, 384, 393, 395, 446 

CG method, 278, 282, 284 

~ preconditioned, 282, 283 

~ rate of convergence, 281 

Chézy coefficient, 51, 305, 324 

chaos, 18 

chaotic dynamical system, 37 

characteristic equation, 185 

characteristic function, 86 

characteristic surface, 56, 307, 507 


characteristics, 71, 75-78, 306, 307, 
309, 310, 325, 339, 350, 363, 376, 
400-403, 406 

checkerboard mode, 235, 537 

chimney problem, 247 

Cholesky decomposition incomplete, 
284 

Christoffel symbol, 497, 543, 548, 557 

circulation, 11, 28, 64, 522 

circulation theorem, 28 

classification 

— of partial differential equations, 54, 
58 

— of shallow-water equations, 306, 324 

Clebsch 

— potential, 7 

— representation, 7 

closed system, 23, 24 

closure problem, 41 

coarse grid correction, 287 

colocated, 232, 240, 242, 243, 261, 262, 
302, 467, 484, 533, 535, 537, 538, 
507, 585, 597 

compatibility condition, 63, 231, 252 

compatibility relations, 238, 509 

compressible, 30 

computing cost, 263, 269 

- CG method, 282 
— of basic iterative method, 269 

— of fast Poisson solver, 293 

— of GCR method, 275 

— of Krylov method, 285 

- of multigrid, 288, 289 
~ uniform, 122, 127, 146 

concave, 371 

conformal mapping, 64, 470 

conjugate gradient method, 278 

~ optimality property, 279 

conservation 

~ of energy, 19 

— of mass, 12 

- of momentum, 13, 14 

conservation form, 113 

conservation law, 2, 11, 22, 339 

— scalar, 361 

conservative, 114 

conservative force, 27-30 

conservative scheme, 373, 374, 376, 377, 
414 

consistency, 167, 168 

consistent, 101, 102 

constitutive relation, 15 


contact discontinuity, 406, 407, 409, 
423, 455, 525, 585, 592, 596, 601 

continuity equation, 12 

— depth averaged, 48 

~ depth-averaged, 49 

continuum hypothesis, 2, 3 

contraction, 495 

contractive scheme, 356, 360 

contravariant 

— base vector, 476, 477, 491, 492, 539, 
540 

— components, 491 

~ order, 488 

— tensor, 487 

— transformation law, 487, 491 

— vector, 487, 490 

convection equation, 70, 340 

convection-diffusion equation, 4, 33, 34, 
53, 99, 111, 133, 174, 228, 234, 235, 
269, 532, 535, 565 

~ nonstationary, 163 

convergence, 165, 167, 168 

convergence analysis, 236, 252 

convergent, 101 

convex, 371 

coordinate invariant, 543, 548 

coordinate system 

— boundary-fitted, 53 

— inertial, 47 

— moving, 47 

— non-orthogonal, 53 

- rotating, 47 

coordinate transformation, 53, 485 

coordinate-invariant, 557 

coordinates 

— admissible transformation, 486 

- boundary-fitted, 470 

- Cartesian, 5, 7, 491 

- curvilinear, 7 

— general, 484, 491 

— orthogonal, 470 

— right-handed, 5, 476 

Coriolis 

— acceleration, 47, 49, 51, 52, 317, 324, 
334 

~ parameter, 324, 327 

Courant-Isaacson-Rees scheme, 349, 
360, 375, 427, 436 

covariant 

— base vector, 476, 491, 492, 539 

~ derivative, 498, 499 

— order, 488 

— transformation law, 492 
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— vector, 487, 490 

Crank-Nicolson scheme, 204, 249, 252, 
255, 258 

credibility, 559 

curl, 6, 8, 500 

cyclic order, 493 

cyclic reduction, 263, 292 


Darcy equations, 3 

defect correction, 129, 132, 224, 225, 
268 

depth-averaged 

- continuity equation, 48, 49 

— momentum equations, 49, 51 

— velocity components, 48 

Descartes, 1 

determinant, 8 

determinism, 18 

diagonal dominance 

— weak, 267 

difference 

— backward, 89 

— finite, 89 

— forward, 89 

differential-algebraic system, 239, 244, 
251 

diffusion, 57 

— numerical, 562 

diffusion coefficient, 34 

diffusion equation, 55, 61, 65 

— rotated anisotropic, 269, 297, 303 

diffusion number, 210 

dimensionless, 111 

— Boussinesq equations, 46 

— energy equation, 47 

— equations, 16, 570, 578 

— Navier-Stokes equations, 17, 228 

— variables, 16 

direct numerical simulation of 
turbulence, 38, 189, 223 

— computing cost, 38 

~ databases, 39 

Dirichlet, 62, 66, 76, 79, 89, 93, 95, 107, 
112, 121, 133, 229 

discretization error 

— local, 186 

— temporal, 186 

dispersion, 213, 319 

— analysis, 320 

— numerical, 321, 329, 336, 356-358 

— relation, 309, 320-323, 328, 356 

dissipation, 21, 213, 319 

- artificial, 389-391 
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— numerical, 320-322, 336, 356-358 
dissipative scheme, 171, 351, 355, 360 
distinguished limit, 74, 75, 78 
distribution matrix, 295, 299, 302 

— of SIMPLE type, 299, 302 
distributive 

— Gauss-Seidel method, 298 

- ILU method, 299 

— iteration, 251, 294, 295 
distributive formulation, 296, 302 
divergence, 500 

divergence theorem, 5 

domain decomposition, 472, 503 
domain of dependence, 311 

domain of influence, 311 
doubly-ruled surface, 474, 479, 483, 539 
drag, 32 

dual time stepping, 576, 577 

dummy index, 486 


eddy viscosity, 41, 42 

efficiency, 163, 224, 263, 270, 297-299, 
302, 303 

— of multigrid method, 288 

elliptic, 54, 55, 58, 70, 112, 532 

energy 

— equation, 20, 21, 24, 27, 32, 33 

— internal, 19, 21, 22 

~ kinetic, 19 

— total, 20 

Engquist-Osher scheme, 375, 377, 378, 
384, 425 

enthalpy, 22, 26, 27, 32 

- stagnation, 31 

— total, 27 

entropy, 22-24, 33, 400, 401, 405, 407, 
411 

~ condition, 366, 367, 370, 371, 374, 
376, 377, 380, 385, 405, 406, 423, 425, 
427, 432, 450, 455, 456, 466, 585 

--— Oleinik’s formulation, 367 

~ solution, 367, 368, 376, 377 

equation of state, 22, 398, 504, 580 

— dimensionless, 581 

— nonconvex, 601 

ERCOFTAC, 4, 560 

error 

— discretization, 563, 564 

- estimate, 69 

- global, 69 

— iterative convergence, 563 

- modeling, 559 

— numerical, 559 


- rounding, 563 

Euler 

~ equations, 32, 397, 503, 567, 568 

—— asymptotic expansion, 579 

~-— boundary conditions, 402 

-— dimensionless, 570, 578 

--— nonconservative, 399, 504, 570, 578 

— explicit scheme, 185, 204, 313, 576 

- implicit scheme, 204, 313, 318, 320, 
576 

Eulerian formulation, 3 

existence, 62, 83, 231 

expansion fan, 370, 380, 385, 407-410, 
423 

expansion shock, 380, 385, 423, 425 

expansion wave, 406 

explicit scheme, 171 

extrapolated BDF scheme, 197 

extremum 


- local, 68, 118, 121, 124 


fan, 370-373, 379 

fast Fourier transform, 292, 293 

fast Poisson solver, 263, 278, 292, 303 

Favre average, 40 

Fick’s law, 34 

finite difference, 4, 81, 84, 87, 91, 95, 99 

finite element, 4, 557 

finite volume, 4, 53, 81, 84, 86, 87, 91, 
95, 99, 100, 114, 135, 232, 233, 241, 
243, 414, 511, 533, 536, 543, 549, 
584, 590 

first law of thermodynamics, 19, 20 

first order upwind scheme, 148, 176, 
237 

FISHPACK, 264 

flow separation, 32 

fluctuation, 40 

fluid interface, 3 

flux, 100, 114 

- convective, 100 

- viscous, 100 

flux limited scheme, 154, 158, 225, 381 

flux limiter, 153 

flux limiter diagram, 157 

flux limiting, 148, 150, 360, 380 

flux splitting, 98 

flux splitting scheme, 384, 436, 441, 
445, 456, 512, 526, 527, 569 

flux vector, 34 

FOC scheme, 176, 178, 180, 182-184, 
193, 195, 197 

Fourier 


t 


analysis, 168, 171 

— discrete transform, 172 

— fast — transform, 292, 293 

integral, 351 

- law, 20, 34 

— smoothing analysis, 291 

- stability analysis, 253 

- transform, 174, 176 

fourth order central scheme, 150, 176, 
193 

fractional step method, 251 

free convection, 43, 46 

free index, 486 

free surface, 3, 48 

free surface condition, 230, 238, 551, 
554 

friction, 17 

frictional heating, 21, 24, 25 

Fromm scheme, 149, 158, 160, 348, 358 

Froude number, 308, 309 

frozen coefficients method, 171 

function space, 62 
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gasdynamics analogy, 52 

Gauss-Seidel 

— alternating damped zebra, 270 

— alternating symmetric line, 270 

Gauss-Seidel method, 268, 269, 292, 
301 

- backward, 273, 283 

- distributive, 298 

— forward, 273, 283 

- nonlinear, 290, 298 

— symmetric, 283 

— symmetric coupled, 300 

GCR method, 273, 275, 278, 282, 284 

— acceleration by —, 276 

— preconditioned, 276 

— rate of convergence, 275 

— restarted, 275 

— truncated, 275 

Gear scheme, 190 

generalized conjugate residuals method 
273 

generalized minimal residual method, 
276 

geometric identity, 478 

geophysical fluid dynamics, 305 

geophysics, 49 

Gibbs equation, 23 

GMRES method, 276, 278, 282, 284 

~ preconditioned, 277, 303 

GMRESR method, 276, 278 
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Godunov 
- flux, 380, 396 
— order barrier theorem, 340, 341, 344, 
346, 359, 378 


~ scheme, 379, 380, 414, 415 
Gram-Schmidt orthogonalization, 274 
Grashoff number, 46, 228 

gravity, 43, 48 


gr 

- boundary-fitted, 293, 303, 468, 469, 
471, 472, 503, 517, 527-529, 531, 557 

- Cartesian, 468, 469, 557, 562 

~ colocated, 467 

— generation, 472 

—- boundary-fitted, 473 

— multi-block, 562 

~ orthogonal at boundary, 551 

— staggered, 467 

— structured, 467-469, 503, 511, 531 

— unstructured, 468, 469 

grid refinement 

- adaptive, 122 

— local, 122 

group velocity, 309, 320, 327, 356 

— numerical, 320, 356 


Hadamard’s problem, 63 

Hansen scheme, 316, 320, 322, 329 

harmonic average, 87 

heat 

— conduction, 25 

— conductivity, 36 

— flux, 20 

- transfer, 22, 25 

—— equation, 25 

Helmholtz 

- equation, 64 

- representation, 7 

Heun 

- method of, 387, 458 

hexahedron, 469, 470, 478, 503 

high performance computing, 264 

homentropic, 24, 26, 30, 32, 33 

homogeneous transformation, 487, 488 

hybrid scheme, 128, 132, 205, 233, 237 

hydraulics, 305 

hydrostatic equilibrium, 44, 45 

hyperbolic, 54, 55, 58, 70, 72, 76, 112, 
305, 306, 325, 339, 400 

— strictly, 400 

hyperbolic system, 505 

hypersonic, 3 


ICCG method, 284 
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ideal fluid, 27, 28, 32 

ill-conditioned, 568 

ill-posed, 63, 116 

ILU, 270, 273, 278, 284, 292, 299, 300 

— distributive, 299, 301 

IMEX scheme, 196, 197, 224, 256, 258, 
557 

IMPES model, 83 

incomplete factorization, 284 

incomplete LU factorization, 299 

incompressibility, 13 

— conditions for, 36, 37 

incompressible, 3, 12, 29, 34, 36 

inconsistent, 101, 103 

inertia, 14, 17 

inertial frame, 47 

inflow boundary, 76, 77, 98, 116, 117, 
532 

inflow condition, 247 

initial condition, 64 

inner equation, 73, 74 

inner solution, 73, 74, 142 

instability, 164 

instable, 119, 169, 313 

interface problem, 82, 84, 87, 88 

internal energy, 19, 22 

Internet, 4, 264, 293, 472, 560 

interstellar medium, 2 

Invariance, 485, 487 

invariant form, 484, 485, 549 

inverse monotonicity, 368 

inviscid, 30 

irreversibility, 21, 57, 65, 367 

irrotational, 6, 29, 30 

isentropic, 24, 33, 34, 36 

isothermal, 25 

iteration matrix, 265 

iteration method, 263 

- stationary, 264, 269, 273, 276, 278, 
282, 286, 291, 297 

—- accelerated, 269 

—~ multigrid acceleration of — , 291 

ITPACK, 264 


Jacobi method, 268, 269 

~ damped alternating line, 270 

Jacobian 

— of coordinate mapping, 8, 476, 477, 
480, 485, 492 

— of flux function, 398, 505 

— spectral number, 568 

Jameson-Schmidt-Turkel scheme, 388 
447, 569 
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JST scheme, 388, 447, 449, 450, 455, 
456, 503, 525 

- TVD version, 391 

jump condition, 83, 364-366, 371, 373, 
374, 384, 395, 403 


kappa scheme, 148, 149, 158, 175, 176, 
178-180, 182-184, 197, 206, 209, 225, 
234, 237, 242, 255, 257, 347-349, 355, 
357, 359, 360, 380, 381, 457, 459, 
528, 529, 537, 597 

Karman vortex street, 563 

Kelvin 

— circulation theorem, 28 

- scale, 22 

kinetic energy, 19 

Knudsen number, 2 

Kolmogorov scale, 38 

Kronecker tensor, 489 

Krylov subspace, 271 

- optimal approximation, 273 

Krylov subspace acceleration, 298 

Krylov subspace method, 269, 270, 273, 
275, 285, 290, 303 

— acceleration, 273 

— acceleration of multigrid by — , 291 

— preconditioned, 278, 284, 293 

— preconditioned with multigrid, 303 

— rate of convergence, 272 

Kutta condition, 64 


Lagrange multiplier, 249 

Lagrangean formulation, 3 

laminar, 3, 4, 42 

LAPACK, 264 

Laplace, 18 

- equation, 30, 55, 59, 66 

Laplacian 

- in general coordinates, 501 

large-eddy simulation of turbulence, 39, 
189, 191, 223, 254, 292 

— computing cost, 39 

— databases, 39 

latitude, 49 

Lax’s equivalence theorem, 168 

Lax-Friedrichs scheme, 349, 360, 375, 
378, 396 

Lax-Wendroff scheme, 348, 355-360, 
380, 388 

Lax-Wendroff theorem, 376 

leapfrog scheme, 314, 321, 323 

leapfrog-Euler scheme, 185, 191, 253, 
255 

Leendertse scheme, 319, 322, 331, 334 


length scale, 163, 208, 222 

Leonardo da Vinci, 18 

lift, 32, 522, 567 

limiter, 155, 382, 390, 391, 395, 457 

— minmod, 161 

- PL, 161 

- superbee, 159 

-- van Albada, 159, 457, 527, 529, 597 
— van Leer, 159 

limiting, 209, 234 

linear transformation, 487, 488 
Liou-Steffen scheme, 441, 466, 516, 524 
local grid refinement, 120-122, 127 
local mesh refinement, 119, 148 

LU factorization, 268 


M-matrix, 97, 98, 104, 225 

MAC scheme, 540 

MacCormack scheme, 375, 380 

Mach 

— conoid, 507 

- number, 3, 31, 37, 567 

mapping 

~ boundary-fitted, 470 

— conformal, 470 

~ orthogonal, 471 

— piecewise multilinear, 472, 474 

— piecewise trilinear, 478, 538 

marker-and-cell scheme, 540 

matched asymptotic expansion, 142 

matching principle, 73, 75, 76, 78 

material 

~ contour, 11 

- curve, 28, 29 

— particle, 3, 9-12, 19, 20, 24, 29, 33, 
34, 48 

— property, 9, 33 

~ surface, 29 

- volume, 10, 14, 20 

matrix 

— adjoint, 331 

~ distribution, 302 

— Hessenberg, 277 

— irreducible, 267, 268 

- K-, 267, 268, 294, 302 

— M-, 266-268, 278, 291, 294, 299 

— normal, 265, 331 

- unitary, 272, 331 

maximum 

— local, 67, 69, 124 

— principle, 53, 66, 67, 69, 104, 116, 
119, 123, 124, 132, 138-140, 146, 163, 
170 
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maximum modulus principle, 220 

mean, 40, 41 

— density-weighted, 40 

mean free path, 2 

mesh Péclet condition, 96, 125 

mesh Péclet number, 97, 104, 117, 120, 
121, 126, 128, 180, 209, 213, 215, 222 

metric tensor, 494 

— contravariant, 494 

- covariant, 494 

— covariant derivative, 499 

— mixed, 494 

minimum 

- local, 68 

— principle, 69, 170 

mixed derivative, 53, 270, 293, 297 

mixed method, 191 

momentum equation, 14 

- depth-averaged, 49, 51 

Monge 

— conoid, 507 

— potential, 7 

monotone, 116, 118, 122, 125, 126, 340 

~ scheme, 69, 377, 380, 382, 384 

— solution, 392 

monotonicity preserving, 339-341, 343, 
344, 346, 368, 369, 382 

multigrid, 269, 270, 285, 292, 293, 295, 
300, 301, 303, 469 

— acceleration, 298, 303 

- approximation property, 298 

— as preconditioner, 291 

- computing work, 288, 289 

- efficiency, 288 

— full, 289 

— Galerkin coarse grid appoximation, 
300 

— nonlinear — method, 290 

~ smoothing factor, 299 

— smoothing property, 298 

— storage requirements, 289 

multilevel method, 285 

multiphase flow, 2, 3, 12 

MUSCL, 380, 456, 458, 459, 512, 518, 
523, 527, 528, 589 


Navier-Stokes equations, 3, 15, 227, 
531, 533, 535, 543, 567 

— dimensionless, 17, 228 

— Reynolds averaged, 40 

nested iteration, 289 

Neumann, 62, 68, 69, 79, 90, 93, 95, 
107, 112, 116, 120, 133, 229, 532 
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Newton, 1 

~ Principia, 1 

— stress hypothesis, 1 

Newton’s method, 234 

Newtonian fluid, 15 

no-slip condition, 51, 229, 245, 524, 
532, 550, 562 

nonconservative scheme, 373, 374, 376 

nonconservative variables, 399 

nonmonotone scheme, 377 

norm 

- ly, 88, 172 

- lp, 169 

- maximum, 88, 122 

— spectral, 265 

normalized variable, 234 

— diagram, 150, 151, 157 


Oleinik 

~ entropy condition, 367 

omega scheme, 202, 204, 208, 218, 221, 
223 

— multistage, 219-221, 223, 224, 256, 
258 

operator splitting, 259 

order barrier, 124, 125, 127, 148, 150, 
339 

— for multistep schemes, 343 

- of Dahiquist, 217 

- of Godunov, 340, 341, 344, 346, 359, 
378 

— theorem, 340, 341 

orthogonal matrix, 509 

Oseen equations, 19 

Osher scheme, 425, 428, 430, 442, 455, 
456, 514, 525 

- H-variant, 432 

- O-variant, 432, 466 

— version of Hemker and Spekreijse, 
431 

outer equation, 73, 76~78, 142 

outer product, 7 

outer solution, 73-75, 77, 142 

outflow boundary, 76, 99, 113, 133, 135, 
532 

outflow condition, 79, 111, 211, 212, 
230, 246, 247, 249, 351, 556, 562 


Péclet 

— mesh — condition, 96, 125 

— mesh — number, 97, 104, 117, 120, 
121, 126, 128, 180, 209, 213, 215, 222 

- number, 34, 112, 122, 127, 134, 224, 
228 


— uniform, 122 

— uniform accuracy, 124, 127, 134 

— uniform computing cost, 124, 127, 
134 

parabolic, 54, 55, 58, 60, 66, 70, 72, 78, 
112, 229, 532 

parallel computing, 472 

parallel processing, 264, 503 

parallelepiped, 480 

Parseval’s theorem, 172 

particle path, 9 

path integral method, 533, 537, 545, 
550, 556 

perfect gas, 22-26, 32, 35, 398, 400, 
504, 580 

permutation 

— even, 7, 493 

— odd, 7, 493 

— symbol, 7, 493 

phase velocity, 35, 309, 327, 356 

~ numerical, 320, 356 

physical component, 495 

physical units, 16 

PISO method, 259, 260, 296, 298, 601 

plane wave, 56, 58, 507 

Poincaré wave, 327 

Poisson equation, 251 

porous media, 3, 81, 83, 109, 371 

positive scheme, 124, 339 

positive type, 122-127, 138, 148, 150, 
154, 155, 157, 170, 171, 205, 208, 
214, 215, 225, 266, 268, 340 

postconditioning, 273, 276, 284, 294, 
296 

potential, 6, 27, 29 

- equation, 30 

~~ full, 31 

— flow, 6, 29-32, 64 

—- compressible, 30 

Prandtl number, 47 

Prandtl-Meyer relation, 403 

preconditioner, 269, 270, 278, 285 

— left, 273 

- right, 273 

preconditioning, 273, 276, 284, 296 

— matrix, 571 

~ of equations of motion, 568, 571, 574 
577 

predesign, 32 

predictability, 42 

— horizon, 42 

— of dynamical systems, 18, 42 

pressure equation, 83 


1 


pressure matrix, 278 

pressure Poisson equation, 251 

pressure-correction method, 251, 257, 
258, 303, 538, 556, 590, 596, 597 

pressure-weighted interpolation, 237, 
302, 537 

prism, 469 

projection method, 251 

prolongation, 286 

propagation, 57 

PWI method, 237-239, 262 


QR factorization, 277 
quadrilateral, 469, 483 
QUICK scheme, 149, 158, 348 
QUICKEST scheme, 348 


random fluctuations, 37 

Rankine-Hugoniot conditions, 403 

RANS, 40 

rarefaction fan, 370 

rarefied gas, 2 

rate of convergence, 265 

rate of strain, 15 

Rayleigh number, 228, 247 

re-entry, 2 

reducible curve, 29 

relaxation parameter, 263, 269 

— optimal, 269 

reservoir engineering, 3, 81, 83, 102, 
109 

residual, 261, 266 

residual averaging, 393 

restriction, 286 

reversibility, 21 

reversible process, 21, 23 

Reynolds 

~ average, 40 

— averaged Navier-Stokes equations, 
40, 41 

— decomposition, 40 

— number, 3, 17, 25, 37, 122, 127, 134 
148, 228, 270, 532, 562, 563 

--— macroscale, 38 

— stress, 41, 42 

— transport theorem, 10 

rheology, 15 

Ricci’s lemma, 488, 496, 499, 501 

Richardson extrapolation, 564, 566 

Riemann 

- approximate solver, 380, 382, 388, 
414, 415, 512, 527, 569 

- Invariant, 309, 310, 400-402, 407, 
509, 518-521 
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— problem, 370, 371, 379, 408, 409, 411, 
415, 512, 513, 585, 592 

- solution, 371, 373 

right-handed, 6, 7, 480, 485 

Robin condition, 62, 112, 229 

robustness, 263, 269, 270, 275, 297-299, 
302, 303 

Roe 

- average, 419, 512, 575 

— flux, 417, 420, 425, 431 

— matrix, 417, 419 

— scheme, 236, 416, 417, 420, 423, 432, 
442, 456, 512, 514, 525, 574 

—-— sonic entropy fix, 423, 459, 514 

Rossby number, 324 

rotating cylinder, 1 

rotating frame, 49 

rotation, 6 

rotation of the earth, 49 

Runge-Kutta method, 199, 218, 391, 
449, 589 

- amplification factor, 199 

— Heun, 387, 458 

~ mixed, 202 

— of Wray, 199, 202, 257, 258 

- SHK, 201, 459, 528, 585, 591, 596 

- stability analysis, 199 

— stability domain, 199 

- TVD, 387, 388, 393, 458, 528 

- WLM, 202 

Runge-Kutta-Crank-Nicolson scheme, 
257 


scalar, 489 

— absolute, 489 

— relative, 489 

SCGS method, 300, 301 

Schur-Cohn theory, 205 

Schwartz inequality, 174, 178 

second law of thermodynamics, 23 

separable partial differential equation 
293 

separation, 567 

separation bubble, 562 

shallow-water approximation, 48, 49 

shallow-water equations, 3, 4, 33, 48, 
52, 305, 339 

- boundary and initial conditions, 310, 
327 

— linearized, 308 

shear stress, 496 

Shishkin grid, 146 

SHK method, 201 
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shock, 32, 339, 363-367, 370, 371, 373, 


401, 403, 409 

— sensor, 389, 448 

shock tube, 408, 411, 432 

- equation, 410 

Sielecki scheme, 317, 320 

SIMPLE method, 296-298, 301-303, 
601 

simple thermodynamic system, 19, 22 

simple wave, 407 

SIMPLEC method, 296, 298 

SIMPLER method, 296 

simply connected, 63 

singular perturbation, 19, 53, 70-73, 
75, 111, 122, 142 

skew-symmetric, 105, 106 

slope limited scheme, 381, 383, 394, 
395, 456, 527, 562, 584, 585, 590, 
596, 597 

slope limiter, 153 

SMART scheme, 158 

smoother, 269, 270, 286 

smoothing, 289 

— analysis, 291, 295, 301 

— factor, 292 

— nonlinear, 290 

— property, 286, 288, 291, 292, 295 

Sobolev space, 82 

solenoidal, 6, 8 

solenoidality condition, 581 

solid boundary, 532 

sonic entropy fix, 425 

sonic glitch, 423, 425, 432, 442, 450, 
456, 466 

sound speed, 570 

sound wave, 37 

SPD, 278, 282, 283 

specific heat 

— at constant presure, 23 

— at constant volume, 23 

specific heats 

— ratio of, 24 

spectral method, 4 

spectral norm, 265 

spectral number, 568 

spectral radius, 265 

spectro-consistent, 105, 106, 109 

speed of sound, 13, 31, 35, 399 

splitting, 264, 276, 294, 296, 300 

— convergent, 266, 268 

— Gauss-Seidel, 299 

~ regular, 266, 268, 295 

spurious 


— mode, 194, 235-237, 240, 242, 537 

— oscillations, 148, 336, 538 

— reflection, 212, 351, 359, 585 

— root, 194 
- solution, 59, 60, 315, 316 

stability, 119, 166, 167, 169, 170, 176, 
208, 224, 227, 253, 376, 392-394, 457, 
458, 585 

~ A-, 217, 218, 220 

— absolute, 168, 171, 174, 221 

- analysis, 168, 318 

-- for the shallow-water equations, 
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- condition, 174, 186 

- domain, 174, 177, 185, 187, 189, 190, 
192, 194, 199, 201, 203, 217, 352, 391, 
449, 459, 528 

~ Fourier analysis, 171, 175, 253, 445 

— local, 171 

— numerical, 163 

- of Adams-Bashforth scheme, 188, 190 

- of Adams-Bashforth-Crank-Nicolson 
scheme, 197 

- of Adams-Bashforth-Euler scheme, 
195 

- of Courant-Isaacson-Rees scheme, 
354 

— of explicit Euler scheme, 186 

— of extrapolated BDF scheme, 198 

- of Hansen scheme, 316, 329 

~ of implicit scheme, 330 

- of kappa-scheme, 353 

— of Lax-Friedrichs scheme, 354 

— of Lax-Wendroff scheme, 353 

- of leapfrog scheme, 315, 323 

- of leapfrog-Euler scheme, 192 

~ of Leendertse scheme, 319, 332 

- of omega scheme, 204 

- of WLM Runge-Kutta method, 203 

— of Wray’s Runge-Kutta method, 200 

- polynomial, 185, 187, 189-191, 218 

~ R-, 217, 218, 226 

- strong, 209, 217, 218, 220, 224, 256, 
258 

~ von Neumann, 174, 177, 184, 185, 
206, 256, 313, 442, 446, 526, 528 
zero-, 168, 171, 173, 221 

staggered, 240- 245, 249, 261, 262, 292, 
293, 303, 312, 328, 467, 538, 540, 
548, 557, 583-585, 589, 590, 597 

standard atmosphere, 17 

standard software, 264 

state variable, 19, 20, 22, 24, 25, 35 


stationary, 112 

stationary iterative method, 264, 273 

statistical average, 42 

Stelling scheme, 334 

stencil, 89, 549 

stiff differential equations, 569, 571 

Stokes 

- equations, 18 

— paradox, 18 

~ theorem, 6, 29 

stratified flow, 14, 43 

streakline, 9 

streamfunction, 227 

streamline, 9 

stress tensor, 14, 496, 498, 531 

stretched coordinate, 72, 77 

subscript notation, 5 

subsonic, 3, 25 

summation convention, 5, 473, 486 

supersonic, 3 

surface force, 14, 20 

Sylvester’s lemma, 61 

symbol, 174, 176, 178, 352, 391, 446, 
449, 459, 526 


templates, 264 

temporal accuracy, 186, 187 

tensor, 487 

- absolute, 488 

— contravariant, 487 

~ Kronecker, 489 

~ law, 487 

— metric, 494 

— mixed, 488 

— rank, 488 

- rate of strain, 15 

— relative, 488, 500 

— stress, 14 

- weight, 488 

tensor analysis, 467, 484, 485, 539, 543 

tensor calculus, 498 

tensor notation, 484 

termination criterion, 265, 266 

thermal conductivity, 20 

thermal expansion coefficient, 46 

time scale, 163, 208, 222 

time-like, 57 

topologically equivalent, 472 

total derivative, 9 

total energy, 20 

total enthalpy, 27 

total variation, 344, 345, 368, 393 

total variation diminishing, 339, 344 
368 


Ej 
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transitive transformation, 487, 488 

transonic, 3, 290, 522 

transonic rarefaction, 380 

transport theorem, 10, 12, 20 

Tricomi equation, 55 

truncation error, 98 

— global, 81, 102, 103, 106, 120, 122, 
139, 141, 149, 163, 166, 167 

— local, 81, 101-103, 106, 107, 126, 130, 
136, 149, 166, 343 

turbulence, 18, 37 

- direct numerical simulation, 38, 189, 
223 

-- computing cost, 38 

—- databases, 39 

- Kolmogorov scale, 38 

- large eddies, 38 

— large-eddy simulation, 39, 189, 191, 
223, 254 

—-— computing cost, 39 

--— databases, 39 

— model, 3, 4, 41, 42, 228 

-- algebraic, 42 

~~ computing cost, 42 

~— k-epsilon, 42 

—— k-omega, 42 

-- one-equation, 42 

—— Reynolds stress, 42 

—— two-equation, 42 

— subgrid-scale modeling, 39 

turbulent, 3, 4, 37, 122 

turn-around time, 264 

TVB, 382 

TVD, 344, 346, 368, 369, 382, 383, 390, 
391, 394, 455, 457-459, 527, 528, 597 

~ scheme, 339, 345, 384-386, 389, 395 

— time-stepping scheme, 393 

two-grid algorithm, 287 

— nonlinear, 290 

two-grid iteration, 287 


unified method for incompressible and 
compressible flow, 567, 589, 601 

uniformly valid, 75 

unique, 62 

uniqueness, 83, 231 

upwind discretization, 97, 98, 115, 
126-128, 146 

upwind scheme, 121, 122 


validation, 559, 566 
van Albada limiter, 585 
van Leer 
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— flux splitting, 438, 442 

- scheme, 436, 440, 455, 516, 524 

-- modification of Hänet c.s., 437 

vector field, 486 

vector notation, 5 

verification, 559, 563, 566 

vertex-centered, 81, 88, 95, 100-102, 
114, 120, 124, 138, 146, 232, 238, 241 

virtual 

— point, 90 

- value, 90, 95, 99 

viscosity, 4, 36 

— artificial, 113, 127, 381, 388, 389, 
447, 455, 525 

— coefficient, 15, 228 

— numerical, 109, 111, 122, 247 

viscous stress, 41 

volume 

— element, 7, 8, 480 

- of cell, 480, 483, 484 

— of parallelepiped, 7, 480 

von Neumann 

— condition, 173, 174 

— stability, 174, 177, 184, 185, 206, 256, 
313, 442, 446, 526, 528 

vorticity, 15, 29, 227, 325, 327 


wall-clock time, 264 
wave 


— dispersive —, 309, 310, 327 
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- equation, 35, 55, 60, 66 

- flood —, 310 

— front, 56, 325, 339 

— gravity —, 309, 310, 320, 322, 326, 
327 

~ nondispersive —, 309, 310 

~ Poincaré —, 327 

- tidal —, 334 

- vorticity —, 325-327 

wave-like, 57 

wavelength, 37, 52 

wavenumber, 172 

weak formulation, 82, 83, 86, 91 

weak solution, 364, 367, 373, 374, 403 

~- genuine, 373, 376, 384 

— uniqueness, 367 

weakly compressible, 589 

website, 4, 39, 264, 285, 560 

well-posed, 62, 63, 65, 66, 78, 246, 248, 
310, 402, 532, 580 

~ elliptic boundary value problem, 62 

wiggles, 53, 68, 111, 113, 118, 119, 121, 
124, 129, 132, 147, 150, 152, 209, 210, 
213-215, 217, 224, 225, 230, 246, 266, 
312, 359, 369, 380, 381, 389, 392, 
447, 450, 455, 456, 562, 565, 585, 592 


zero-stability, 168 
zonal method, 472 


